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(57) Abstract: The invention provides a method of 
predicting the responsiveness of a breast cancer patient to 
chemotherapy based on the patient's cellular constituent 
profile comprising measurements of a plurality of 
cellular constituents in a sample derived from the patient, 
the patients estrogen receptor level, and the patient's 
estrogen receptor level relative to the patient's age. The 
invention also provides a method for selecting patients for 
enrollment in a clinical trial of a drug for treating breast 
cancer based on these factors. 
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METHODS OF PREDICTING CHEMOTHERAPY RESPONSIVENESS IN BREAST 
CANCER PATIENTS 



This application claims benefit of U.S. Provisional Patent Application No. 
60/650,365, filed on February 4, 2005, which is incorporated by reference herein in its 
5 entirety. 



1. FIELD OF THE INVENTION 

The present invention relates to a method of predicting chemotherapy responsiveness 
in breast cancer patients. The invention also relates to a method of selecting patients for 
1 0 enrollment in clinical trials of breast cancer drugs. 

2. BACKGROUND OF THE INVENTION 

The increased number of cancer cases reported in the United States, and, indeed, 
around the world, is a major concern. Currently there are only a handful of treatments 
available for specific types of cancer, and these provide no guarantee of success. In order to 
1 5 be most effective, these treatments require not only an early detection of the malignancy, but 
a reliable assessment of the severity of the malignancy. 

The incidence of breast cancer, a leading cause of death in women, has been gradually 
increasing in the United States over the last thirty years. Its cumulative risk is relatively high; 
1 in 8 women are expected to develop some type of breast cancer by age 85 in the United 

20 States. In fact, breast cancer is the most common cancer in women and the second most 
common cause of cancer death in the United States. In 1997, it was estimated that 181,000 
new cases were reported in the U.S., and that 44,000 people would die of breast cancer 
(Parker etal, CA Cancer J. Clin. 47:5-27(1997); Chuet al.,J. Nat. Cancer Inst. 88:1571- 
1 579 (1996)). While mechanism of tumorigenesis for most breast carcinomas is largely 

25 unknown, there are genetic factors that can predispose some women to developing breast 
cancer (Miki etal, Science, 266:66-71(1994)). 

Sporadic tumors, those not currently associated with a known germline mutation, 
constitute the majority of breast cancers. It is also likely that other, non-genetic factors also 
have a significant effect on the etiology of the disease. Regardless of the cancer's origin, 
30 breast cancer morbidity and mortality increases significantly if it is not detected early in its 
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progression. Thus, considerable effort has focused on the early detection of cellular 
transformation and tumor formation in breast tissue. 

A marker-based approach to tumor identification and characterization promises 
unproved diagnostic and prognostic reliability. Typically, the diagnosis of breast cancer 
5 requires histopathological proof of the presence of the tumor. In addition to diagnosis, 
histopathological examinations also provide information about prognosis and selection of 
treatment regimens. Prognosis may also be established based upon clinical parameters such 
as tumor size, tumor grade, the age of the patient, and lymph node metastasis. 

Diagnosis and/or prognosis may be determined to varying degrees of effectiveness by 
10 direct examination of the outside of the breast, or through mammography or other X-ray 
imaging methods (Jatoi, Am. J. Surg. 177:518-524 (1999)). The latter approach is not 
without considerable cost, however. Every time a mammogram is taken, the patient incurs a 
small risk of having a breast tumor induced by the ionizing properties of the radiation used 
during the test. In addition, the process is expensive and the subjective interpretations of a 
15 technician can lead to imprecision. For example, one study showed major clinical 

disagreements for about one-third of a set of mammograms that were interpreted individually 
by a surveyed group of radiologists. Moreover, many women find that undergoing a 
mammogram is a painful experience. Accordingly, the National Cancer Institute has not 
recommended mammograms for women under fifty years of age, since this group is not as 
20 likely to develop breast cancers as are older women. It is compelling to note, however, that 
while only about 22% of breast cancers occur in women under fifty, data suggests that breast 
cancer is more aggressive in pre-menopausal women. 

In clinical practice, accurate diagnosis of various subtypes of breast cancer is 
important because treatment options, prognosis, and the likelihood of therapeutic response all 

25 vary broadly depending on the diagnosis. Accurate prognosis, or determination of distant 
metastasis-free survival could allow the oncologist to tailor the administration of adjuvant 
chemotherapy, with women having poorer prognoses being given the most aggressive 
treatment. Furthermore, accurate prediction of poor prognosis would greatly impact clinical 
trials for new breast cancer therapies, because potential study patients could then be stratified 

30 according to prognosis. Trials could then be limited to patients having poor prognosis, in 
turn making it easier to discern if an experimental therapy is efficacious. 
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To date, no set of satisfactory predictors for prognosis based on the clinical 
information alone has been identified. Many have observed that the ER status has a dominant 
signature in the breast tumor gene expression profiling. See West et al, Proc. Natl. Acad. 
Sci. U.S.A. 98:1 1462 (2001); van 't Veer et al, Nature 415:530 (2002); Sorlie et al, Proc. 

5 Natl Acad. Sci. U.S.A. 100:8418 (2003); Perou et al, Nature 406:747 (2000); Gruvberger et 
al, Cancer Res. 61:5979 (2001); Sotiriouef al, Proc. Natl Acad. Sci. U.S.A. 100:10393 
(2003). It is generally accepted that there is some relationship between patient survival and 
ER status, van de Vijver et al, N. Engl J. Med. 347:1999 (2002); Surowiak et al, Folia 
Histochem. Cytobiol 39:143 (2001); Pichon et al, Br. J. Cancer 73:1545 (1996); Collett et 

10 al, J. Clin. Pathol. 49:920 (1996). BRCA1 mutations are related to the familial cancer 
susceptibility. Biesecker et al.,JAMA 269:1970 (1993); Easton et al, Cancer Surv. 18:95 
(1993). Age is also considered to be a prognosis factor since young cancer patients tend to 
have poor tumors. Maggard et al, J. Surg. Res. 1 13:109 (2003). Lymph node status is a 
factor in deciding the treatment. Eifel et al, J. Natl Cancer Inst. 93:979 (2001). 

1 5 The discovery and characterization of BRCA 1 and BRCA2 has recently expanded our 

knowledge of genetic factors which can contribute to familial breast cancer. Germ-line 
mutations within these two loci are associated with a 50 to 85% lifetime risk of breast and/or 
ovarian cancer (Casey, Curr. Opin. Oncol 9:88-93 (1997); Marcus et al, Cancer 77:697- 
709 (1996)). Only about 5% to 10% of breast cancers, however, are associated with breast 

20 cancer susceptibility genes, BRCA1 and BRCA2. The cumulative lifetime risk of breast 

cancer for women who carry the mutant BRCA1 is predicted to be approximately 92%, while 
the cumulative lifetime risk for the non-carrier majority is estimated to be approximately 
10%. BRCA1 is a tumor suppressor gene that is involved in DNA repair and cell cycle 
control, which are both important for the maintenance of genomic stability. More than 90% 

25 of all mutations reported so far result in a premature truncation of the protein product with 
abnormal or abolished function. The histology of breast cancer in BRCA1 mutation carriers 
differs from that in sporadic cases, but mutation analysis is the only way to find the carrier. 
Like BRCA1, BRCA2 is involved in the development of breast cancer, and like BRCA1 plays 
a role in DNA repair. However, unlike BRCA1, it is not involved in ovarian cancer. 

30 Other genes have been linked to breast cancer, for example c-erb-2 (HER2) and p53 

(Beenken et al, Ann. Surg. 233(5):630-638 (2001). Overexpression of c-erb-2 (HER2) and 
p53 have been correlated with poor prognosis (Rudolph et al, Hum. Pathol. 32(3):3 1 1-319 
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(2001) , as has been aberrant expression products of mdm2 (Lukas et al, Cancer Res. 
61(7):3212-3219 (2001) and cyclinl and p27 (Porter & Roberts, International Publication 
WO98/33450, published August 6, 1998). 

The detection of BRCA1 or BRCA2 mutations represents a step towards the design of 
therapies to better control and prevent the appearance of these tumors. Recently, many 
studies have used gene expression profiling to analyze various cancers, and those studies 
have provided new diagnosis and prognosis information in the molecular level. See 
Zajchowski et al, "Identification of Gene Expression Profiled that Predict the Aggressive 
Behavior of Breast Cancer Cells," Cancer Res. 61:5168 (2001); West et al, "Predicting the 
Clinical Status of Human Breast Cancer by Using Gene Expression Profiles," Proc. Natl. 
Acad. Sci. U.S.A. 98:1 1462 (2001); van 't Veer eta!., "Gene Expression Profiling Predicts the 
Outcome of Breast Cancer," Nature 415:530 (2002); Roberts era/., "Diagnosis and Prognosis 
of Breast Cancer Patients," WO 02/103320; Sorlie et al, Proc. Natl Acad. Sci. U.S.A. 
100:8418 (2003); Perou et al, Nature 406:747 (2000); Khan et al, Cancer Res 58, 5009 
(1998); Golub etal, Science 286, 531 (1999); DeRisi et al, Nat. Genet. 14:457 (1996); 
Alizadeh et al, Nature 403, 503 (2000). Methods for the identification of informative 
genesets for various cancers have also been described. See Roberts et al, "Diagnosis and 
Prognosis of Breast Cancer Patients," WO 02/103320; Golub et al, United States Patent No. 
6,647,341. 

Genesets have been identified that are informative for differentiating individuals 
having, or suspected of having, breast cancer based on estrogen receptor (ER) status, or 
BRCA1 mutation vs. sporadic (i.e., other than BRCAl-type) mutation status. See Roberts et 
al, WO 02/103320; van't Veer et al, Nature 415:530 (2001). Genesets have also been 
identified that enable the classification of sporadic tumor-type individuals as those who will 
likely have no metastases within five years of initial diagnosis (i.e., individuals with a good 
prognosis) or those who will likely have a metastasis within five years of initial diagnosis 
(i.e., those having a poor prognosis). Roberts, supra; van't Veer, supra. 

Roberts et al. WO 02/103320 and van de Vijver et al, N. Engl J. Med. 347:1999 

(2002) describe a 70-gene set, useful for the prognosis of breast cancer, which outperformed 
clinical measures of prognosis, and which showed good potential in selecting good outcome 
patients, thereby avoiding over-treatment. The expression of genes with most predictive 
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value, however, was not homogeneous among poor patients, suggesting the need for 
improvement. 

Although the patterns of gene expression as described in Roberts et al. were 
correlated with existing clinical indicators such as estrogen receptor and BRCA1 status, 
5 clinical measures were not incorporated. Furthermore, although the poor-outcome group in 
particular showed heterogeneity in expression pattern, the best classifier decision rule found 
during these studies was a fairly simple one based on the similarity of a patient profile to the 
average profile of a good-outcome training group. 

It is evident that breast cancer is the result of more than one type of molecular event. 
1 0 Likewise, a variety of other conditions, such as other cancers; non-cancer diseases such as 
diabetes, autoimmune or neurodegenerative disorders, obesity; etc., are also the result of 
more than one molecular event. Moreover, an individual's response to exposure to particular 
environmental conditions, for example, exposure to natural or man-made agents, such as 
toxins, pollutants, drugs, food additives, etc., likely result from more than one molecular 
1 5 event. Thus, there exists a need for improved prognostic methods so that appropriate courses 
of prophylaxis and/or therapy may be provided. Genesets having improved prognostic power 
can be identified by first identifying discrete subsets of individuals based on genotypic or 
phenotypic characteristics relevant to the disease or condition, and then identifying genesets 
informative for prognosis within those subsets of patients. Individuals having the condition, 
20 or who are suspected of having the condition, such as breast cancer, would then be provided 
therapies appropriate to the molecular mechanisms underlying the condition. The present 
invention provides such methods for breast cancer, and for other cancers, diseases or 
conditions. 

3. SUMMARY OF THE INVENTION 

25 The present invention provides a method for predicting responsiveness of a breast 

cancer patient to chemotherapy based on one or more of the following (i) a cellular 
constituent profile comprising measurements of a plurality of cellular constituents in a sample 
derived from said patient, (ii) an ER level, said ER level being a level of an estrogen receptor 
transcript or estrogen receptor protein in a sample derived from said patient, wherein ER + 

30 designates a high ER level and ER" designates a low ER level, and (iii) an ER/AGE, said 
ER/AGE being a metric of said ER level relative to the age of said patient, said method 
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comprising predicting said patient to exhibit (a) enhanced response to chemotherapy as 
compared to patients in the general population of breast cancer patients if said cellular 
constituent profile is a poor prognosis profile, and said ER level is ER + , and said ER/AGE is 
low; or (b) reduced response to chemotherapy as compared to patients in the general 
5 population of breast cancer patients if (i) said ER level is ER", or (ii) said cellular constituent 
profile is a poor prognosis profile, and said ER level is ER + , and said ER/AGE is high; 
wherein said cellular constituent profile is classified as a poor prognosis profile if said 
cellular constituent profile has a low similarity to a good prognosis template or has a high 
similarity to a poor prognosis template, said good prognosis template comprising 
1 0 measurements of said plurality of cellular constituents representative of levels of said cellular 
constituents in a plurality of good outcome patients and said poor prognosis template 
comprising measurements of said plurality of cellular constituents representative of levels of 
said cellular constituents in a plurality of poor outcome patients, wherein a good outcome 
patient is a breast cancer patient who has non-occurrence of metastases within a first period 
1 5 of time after initial diagnosis and a poor outcome patient is a patient who has occurrence of 
metastases within a second period of time after initial diagnosis. 

In one embodiment, said cellular constituent profile is determined as a poor prognosis 
profile, and said ER level is determined as ER + , and said ER/AGE is determined as low, and 
said patient is predicted to have enhanced response to chemotherapy as compared to patients " 

20 in the general population of breast cancer patients. In another embodiment, said cellular 
constituent profile is determined as a poor prognosis profile, said ER level is determined as 
ER + , and said ER/AGE is determined as high, and said patient is predicted to have reduced 
response to chemotherapy as compared to patients in the general population of breast cancer 
patients. In still another embodiment, said ER level is determined as ER", and said patient is 

25 predicted to have reduced response to chemotherapy as compared to patients in the general 
population of breast cancer patients. 

The invention also provides a method for selecting a patient for enrollment in a 
clinical trial of a drug for treating breast cancer based on one or more of the following (i) a 
cellular constituent profile comprising measurements of a plurality of cellular constituents in 
30 a sample derived from said patient, (ii) an ER level, said ER level being a level of an estrogen 
receptor transcript or estrogen receptor protein in a sample derived from said patient, wherein 
ER + designates a high ER level and ER" designates a low ER level, and (iii) an ER/AGE, said 
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ER/AGR being a metric of said ER level relative to the age of said patient, said method 
comprising selecting a patient for inclusion in said clinical trial if (a) said ER level is ER"; or 
(b) said cellular constituent profile is a poor prognosis profile, said ER level is ER + , and said 
ER/AGE is high; wherein said cellular constituent profile is classified as a poor prognosis 
5 profile if said cellular constituent profile has a low similarity to a good prognosis template or 
has a high similarity to a poor prognosis template, said good prognosis template comprising 
measurements of said plurality of cellular constituents representative of levels of said cellular 
constituents in a plurality of good outcome patients and said poor prognosis template 
comprising measurements of said plurality of cellular constituents representative of levels of 
1 0 said cellular constituents in a plurality of poor outcome patients, wherein a good outcome 
patient is a breast cancer patient who has non-occurrence of metastases within a first period 
of time after initial diagnosis and a poor outcome patient is a breast cancer patient who has 
occurrence of metastases within a second period of time after initial diagnosis. 

In one embodiment, said ER level is determined as ER", and said patient is selected. 
15 In another embodiment, said cellular constituent profile is determined as a poor prognosis 
profile, said ER level is determined as ER + , and said ER/AGE is determined as high, and said 
patient is selected. 

The invention also provides a method for identifying a breast cancer patient as a good 
or poor candidate for chemotherapy based on one or more of the following (i) a cellular 

20 constituent profile comprising measurements of a plurality of cellular constituents in a sample 
derived from said patient, (ii) an ER level, said ER level being a level of an estrogen receptor 
transcript or estrogen receptor protein in a sample derived from said patient, wherein ER + 
designates a high ER level and ER" designates a low ER level, and (iii) an ER/AGE, said 
ER/AGE being a metric of said ER level relative to the age of said patient, said method 

25 comprising (a) determining whether said cellular constituent profile is a poor prognosis 
profile, said ER level is ER + , and said ER/AGE is low, wherein if said cellular constituent 
profile is a poor prognosis profile, said ER level is ER + , and said ER/AGE is low, said breast 
cancer patient is identified as a good candidate for chemotherapy; or (b) determining whether 
said ER level is ER", whereby if said ER level is ER", said the breast cancer patient is 

30 identified as a poor candidate for chemotherapy; or (c) determining whether said cellular 

constituent profile is a poor prognosis profile, said ER level is ER + , and said ER/AGE is high, 
wherein if said cellular constituent profile is a poor prognosis profile, said ER level is ER + , 
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and said ER/AGE is high, said breast cancer patient is identified as a poor candidate for 
chemotherapy. 

In one embodiment, said cellular constituent profile is determined as a poor prognosis 
profile, said ER level is determined as ER + , and said ER/AGE is determined as low, and said 

5 breast cancer patient is identified as a good candidate for chemotherapy. In another 
embodiment, said ER level is determined as ER", and said the breast cancer patient is 
identified as a poor candidate for chemotherapy. In still another embodiment, said cellular 
constituent profile is determined as a poor prognosis profile, said ER level is determined as 
ER + , and said ER/AGE is determined as high, and said breast cancer patient is identified as a 

1 0 poor candidate for chemotherapy. 

In embodiments of the methods of the invention, said first period is 10 years and said 
second period is 10 years. In another embodiment, said patient is under the age of 55, and 
wherein each said good outcome patient and each said poor outcome patient are under the age 
of 55 at time of diagnosis of breast cancer. 

15 In one embodiment, any one of the methods of the invention further comprises 

determining said cellular constituent profile, said ER level, and/or, said ER/AGE. 

In another embodiment, in any one of the methods of the invention said cellular 
constituent profile comprises measurements of a plurality of transcripts in a sample derived 
from said patient, wherein said good prognosis template comprises measurements of said 
20 plurality of transcripts representative of expression levels of said transcripts in said plurality 
of good outcome patients, and wherein said poor prognosis template comprises measurements 
of said plurality of transcripts representative of expression levels of said transcripts in said 
plurality of poor outcome patients. 

In one embodiment, said cellular constituent profile is a differential expression profile 
25 comprising differential measurements of said plurality of transcripts in said sample derived 
from said patient versus measurements of said plurality of transcripts in a control sample. In 
one embodiment, said differential measurements is selected from the group consisting of 
xdev, log(ratio), error-weighted log(ratio), and mean subtracted log(intensity). 

In another embodiment, said cellular constituent profile comprises measurements of a 
30 plurality of protein species in a sample derived from said patient, wherein said good 
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prognosis template comprises measurements of said plurality of protein species representative 
of levels of said protein species in said plurality of good outcome patients, and wherein said 
poor prognosis template comprises measurements of said plurality of protein species 
representative of levels of said protein species in said plurality of poor outcome patients. 



template is an average of expression levels of said transcript in said plurality of good 
outcome patients. 

In one embodiment, similarity of said cellular constituent profile to said good 
prognosis template is represented by a correlation coefficient between said cellular 
1 0 constituent profile and said good prognosis template, wherein said correlation coefficient 
greater than a correlation threshold, e.g., 0.5, indicates a high similarity and said correlation 
coefficient equal to or less than said correlation threshold indicates a low similarity. 

In another embodiment, similarity of said cellular constituent profile to said good 
prognosis template is represented by a distance between said cellular constituent profile and 
1 5 said good prognosis template, wherein said distance less than a given value indicates a high 
similarity and said distance equal to or greater than said given value indicates a low 
similarity. 

In one embodiment, said ER level is determined by measuring an expression level of 
a gene encoding said estrogen receptor in said patient relative to expression level of said gene 
20 in said control sample, and wherein said ER level is classified as ER + if logl O(ratio) of said 
expression level is greater than -0.65, and wherein said ER level is classified as ER' if 
logl0(ratio) of said expression level is equal to or less than -0.65. In one embodiment, said 
gene encoding said estrogen receptor is the estrogen receptor a gene. 

In one embodiment, said ER/AGE is classified as high if said ER level as measured 
25 by logl 0(ratio) is greater than c (AGE - d), and wherein said ER/AGE is classified as low if 
said ER level is equal to or less than c-{ AGE - d), wherein c is a coefficient, AGE is the age 
of said patient, and d is an age threshold. 

In one embodiment, said estrogen receptor level is measured by an oligonucleotide 
probe that detects a transcript corresponding to the gene having accession number 



5 



In one embodiment, measurement of each said transcript in said good prognosis 
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NM_000125, wherein said control sample is apool of breast cancer cells of different patients, 
and wherein c = 0. 1 and d = 42.5. 

In one embodiment, said control sample is generated by pooling together cDNAs of 
said plurality of transcripts from a plurality of breast cancer patients. In another embodiment, 
said control sample is generated by pooling together synthesized cDNAs of said plurality of 
transcripts and said transcript of said gene encoding said estrogen receptor. 

In one embodiment, said plurality of transcripts consists of transcripts corresponding 
to at least a portion of the set of genes listed in Table 8. In another embodiment, said 
plurality of transcripts consists of all transcripts corresponding to genes listed in Table 8. 

In one embodiment, said chemotherapy is carried out using the CMF combination 
consisting of cyclophosphamide, methotrexate, and 5-fluorouracil. 

The invention also provides a computer system comprising a processor, and a 
memory coupled to said processor and encoding one or more programs that cause the 
processor to carry out any one of the methods of the invention. 

The invention also provides computer program product for use in conjunction with a 
computer having a processor and a memory connected to the processor, said computer 
program product comprising a computer readable storage medium having a computer 
program mechanism encoded thereon, which computer program mechanism may be loaded 
into the memory of said computer and cause said computer to carry out any one of the 
methods of the invention. 

The invention also provides a method for predicting responsiveness of a breast cancer 
patient to chemotherapy based on one or more of the following (i) a cellular constituent 
profile comprising measurements of a plurality of cellular constituents in a sample derived 
from said patient, (ii) an ER level, said ER level being a level of an estrogen receptor 
transcript or estrogen receptor protein in a sample derived from said patient, wherein ER + 
designates a high ER level and ER' designates a low ER level, and (iii) an ER/AGE, said 
ER/AGE being a metric of said ER level relative to the age of said patient, said method 
comprising predicting said patient to exhibit (a) enhanced response to chemotherapy as 
compared to patients in the general population of breast cancer patients if said cellular 
constituent profile is a poor prognosis profile, and said ER level is ER + , and said ER/AGE is 
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low; or (b) reduced response to chemotherapy as compared to patients in the general 
population of breast cancer patients if (i) said ER level is ER\ or (ii) said cellular constituent 
profile is a poor prognosis profile, and said ER level is ER + , and said ER/AGE is high, 
wherein said cellular constituent profile is classified as a good prognosis profile if said 

5 cellular constituent profile predicts non-occurrence of metastases in said breast cancer patient 
within a predetermined period of time after initial diagnosis, and wherein said cellular 
constituent profile is classified as a poor prognosis profile if said cellular constituent profile 
predicts occurrence of metastasis within said predetermined period of time. In one 
embodiment, the predetermined period of time is 5 or 10 years. 

1 0 In one embodiment, said cellular constituent profile comprises measurements of a 

plurality of transcripts in a sample derived from said patient. In another embodiment, said 
cellular constituent profile is a differential expression profile comprising differential 
measurements of said plurality of transcripts in said sample derived from said patient versus 
, measurements of said plurality of transcripts in a control sample. In one embodiment, said 

15 control sample is generated by pooling together synthesized cDNAs of said plurality of 
transcripts. 

In a preferred embodiment, said plurality of transcripts consists of transcripts 
corresponding to at least a portion, e.g., at least 5, 10, 40, 50 or all, of the genes listed in 
Table 8. 

20 In one embodiment, said cellular constituent profile is classified as a good prognosis 

profile or a poor prognosis profile using an artificial neural network, which receives said 
cellular constituent profile as an input and generates an output comprising data indicating 
whether said cellular constituent profile is a good prognosis profile or a poor prognosis 
profile. 

25 In another embodiment, said cellular constituent profile is classified as a good 

. prognosis profile or a poor prognosis profile using a supporting vector machine, which 
receives said cellular constituent profile as an input and generates an output comprising data 
indicating whether said cellular constituent profile is a good prognosis profile or a poor 
prognosis profile. 

30 4. BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 depicts the decision tree that resulted in the five patient subsets used to identify 
informative prognosis-related genes. 
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FIG. 2 shows the relationship between ER level and age. (A) Scatter plot of ER vs. 
age for ER+ patients. Black dots indicate metastases free samples, and gray dots indicate 
metastases samples. It appears that patients of ER+ group can be subdivided into "ER+, 
ER/AGE high" group (above the black line) and "ER+, ER/AGE low" (below the black line) 
5 group. The black line is approximated by ER = 0. 1 *(AGE-42.5), and the dashed line by ER 
= 0.1*(age-50). Within each population, the ER level also increases with age. (B) Age 
distribution of all patients in ER+ samples. A bimodal distribution is observed. (C) ER- 
modulated age (age - 10*) distribution of all patients in ER+ samples. A bimodal 
distribution is observed. (D) Age distribution of samples with metastasis. (E) ER-modulated 
1 0 age distribution of samples with metastasis. The three peaks appearing in this distribution 
suggest a polymorphism. 

FIG. 3. Performance of classifier for the "ERVsporadic" group. (A) Error rate 
obtained from leave-one-out cross validation (LOOCV) for predicting the disease outcome as 
a function of the number of reporter genes used in the classifier. (B) Scatter plot between 

1 5 correlation to good group (X axis) and to poor group (Y axis). Circles indicate metastases- 
free samples, squares indicate samples with metastases. Dashed line: threshold for separating 
poor from good. (C) Error rate calculated with respect to good outcome group (good 
outcome misclassified as poor divided by total number of good), or poor outcome group 
(poor outcome misclassified as good divided by total number of poor), or the average of the 

20 two rates. 

FIG. 4. Performance of classifier for the "ER+, ER/AGE high" group. (A) Error rate 
obtained from leave-one-out cross validation (LOOCV) for predicting the disease outcome as 
a function of the number of reporter genes used in the classifier. (B) Scatter plot between 
correlation to good group (X axis) and to poor group (Y axis). Circles indicate metastases- 
25 free samples, and squares indicate samples with metastases. Dashed line: threshold for 
separating poor from good. (C) Error rate calculated with respect to good outcome group 
(good outcome misclassified as poor divided by total number of good), or poor outcome 
group (poor outcome misclassified as good divided by total number of poor), or the average 
of the two rates. 

30 FIG. 5. Performance of classifier for the "ER+, ER/AGE low/LN~" group. (A) Error 

rate obtained from leave-one-out cross validation (LOOCV) for predicting the disease 
outcome as a function of the number of reporter genes used in the classifier. (B) Scatter plot 
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between correlation to good group (X axis) and to poor group (Y axis). Circles indicate 
metastases-free samples, and squares indicates samples with metastases. Dashed line 
indicates the threshold for separating poor from good. (C) Error rate calculated with respect 
to good outcome group (good outcome misclassified as poor divided by total number of 
5 good), or poor outcome group (poor outcome misclassified as good divided by total number 
of poor), or the average of the two rates. 

FIG. 6. Performance of classifier for the "ER+, ER/AGE low/LN+" group. (A) Error 
rate obtained from leave-one-out cross validation (LOOCV) for predicting the disease 
outcome as a function of the number of reporter genes used in the classifier. (B) Scatter plot 

10 between correlation to good group (X axis) and to poor group (Y axis). Circles indicate 
metastases free samples, squares indicate samples with metastases. Dashed line: threshold 
for separating poor from good. (C) Error rate calculated with respect to good outcome group 
(good outcome misclassified as poor divided by total number of good), or poor outcome 
group (poor outcome misclassified as good divided by total number of poor), or the average 

15 of the two rates. 

FIG. 7. Performance of classifier for the "ER~, BRCA1" group. (A) Error rate 
obtained from leave-one-out cross validation (LOOCV) for predicting the disease outcome as 
a function of the number of reporter genes used in the classifier. (B) Scatter plot between 
■ correlation to good group (X axis) and to poor group (Y axis). Circles indicate metastases 
20 free samples, squares indicate samples with metastases. Dashed line: threshold for separating 
poor from good. (C) Error rate calculated with respect to good outcome group (good 
outcome misclassified as poor divided by total number of good), or poor outcome group 
(poor outcome misclassified as good divided by total number of poor), or the average of the 
two rates. 

25 FIG. 8. Heatmaps of genes representing key biological functions in subgroups of 

patients: A: Cell cycle genes are predictive of outcome in patients with ER/age high. B: Cell 
cycle genes are not predictive of outcome in "ER- and sporadic" patients C: Glycolysis genes 
are predictive of outcome in patients with ER/age low and LN-. D: Glycolysis genes are not 
predictive of outcome in "ER- & BRCA1" patients. 

30 FIG. 9 shows metastasis free probability for the entire set of breast cancer patients 

younger than 55 years of age, with or without chemotherapy. 
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FIG. 10 shows the effect of chemotherapy in (a) patients with "good signature" as 
predicted by the "70 gene" classifier, and (b) patients with "poor signature". 

FIG. 1 1 shows the effect of chemotherapy in (a) ER- patients, and (b) ER+ patients 
with "poor signature". 

5 FIG. 12 ER+ patients are divided into "ER/age high" (above the line) and "ER/age 

low" (below the line) group according to their ER expression level relative to patient age at 
diagnosis. The ER expression level is measured by gene expression array. The control 
channel is the pool of all breast cancer samples. The line is represented as y = 0.1*(x-42.5). 

FIG. 13 shows the effect of chemotherapy in "poor signature" patients in (a) "ER/age 
10 high" group, and (b) "ER/age low" group. 

5. DETAILED DESCRIPTION OF THE INVENTION 

The invention provides a method of predicting the responsiveness of a breast cancer 
patient to chemotherapy based on the patient' s cellular constituent profile comprising 
measurements of a plurality of cellular constituents in a sample derived from the patient, the 

1 5 patient's estrogen receptor level (hereinafter designated as "ER level"), and the patient's 

estrogen receptor level relative to the patient's age (hereinafter designated as "ER/AGE"). In 
the method of the invention, a patient is classified into an appropriate chemotherapy 
responsiveness group as compared to patients in the general population of breast cancer 
patients. The inventors have discovered that, among patients whose cellular constituent 

20 profile indicates poor prognosis, a patient's responsiveness to chemotherapy depends not only 
on the patient's ER level, but also on the change of the ER level with age. The inventors 
have found that patients who show high ER level at an earlier age (thus a high ER/AGE) 
show little response to chemotherapy, whereas patients who show high ER level at later age 
(thus a low ER/AGE) show increased response to chemotherapy. In particular, the inventors 

25 have found that a patient exhibits enhanced response to chemotherapy as compared to 

patients in the general population of breast cancer patients if the patient's cellular constituent 
profile indicates poor prognosis, and the patient's ER level is ER + , and the patient's estrogen 
receptor level relative to the patient's age ER/AGE is low, whereas a patient exhibits reduced 
response to chemotherapy as compared to patients in the general population if (i) the patient's 

30 ER level is ER', or (ii) the patient's cellular constituent profile indicates poor prognosis, and 
the patient's ER level is ER + , and the patient's estrogen receptor level relative to the patient's 
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age ER/AGE is high. Herein, ER + designates a high ER level and ER' designates a low ER 
level. In addition, the inventors have also found that if a patient's cellular constituent profile 
indicates a good prognosis, then the patient does not need chemotherapy. The method of the 
invention is particularly useful for predicting the responsiveness of a breast cancer patient 
5 under the age of 55. 

The invention also provides a method for selecting patients for enrollment in a clinical 
trial of a drug for treating breast cancer based on each patient's cellular constituent profile, 
the patient's estrogen receptor level, and the patient's estrogen receptor level relative to the 
patient's age. In the method of the invention, patients having reduced responsiveness to 
1 0 chemotherapy as compared to patients in the general population of breast cancer patients are 
preferably selected for inclusion in a clinical trial. In one embodiment, a patient who has 
either ER" or a combination of (a) poor prognosis, (b) ER + , and (c) ER/AGE high are selected 
for inclusion in the trial. Such patients are predicted not to exhibit significant response to 
chemotherapy, and thus are good candidates in determining efficacy of a new breast cancer 



Patient groups can be classified according to at least one of age, lymph node (LN) 
status, estrogen receptor (ER) level, and BRCA1 mutation status into discrete patient subsets. 
These clinical factors have been implicated in tumor etiology as well as differences in disease 
outcome. These characteristics are not limiting; other genotypic or phenotypic characteristics 

20 of breast cancer, for example, tumor grade, tumor size, tumor cell type, etc., may be used, 
alone or in combination with those listed herein, in order to classify individuals. The 
differences in gene expression or in tumor fate related to these parameters likely represent 
differences in tumor origin and tumor genesis, and are therefore good candidates for tumor 
stratification. Genesets informative for prognosis within each subset are then identified. 

25 New breast cancer patients are then classified using the same criteria, and a prognosis is made 
based on the geneset specific for the patient subset into which the patient falls. In the process 
of constructing a prognosis classifier within each patient subset, particular attention is paid to 
the homogeneous patterns related to the tumor outcome. Emergence of such homogeneous 
prognosis patterns may indicate the most common mechanism to metastasis within a subset. 

30 At the same time, successful identification of such patterns also justifies the parameters being 
used for the tumor stratification. To differentiate this approach from an mRNA-alone 



15 



drug. 
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approach, we refer to the current approach of integrating clinical data with the gene 
expression data as a "comprehensive prognosis". 

The measurements in the cellular constituent profiles can be any suitable measured 
values of the cellular constituents, e.g., measurements of the expression levels of genes. The 
measurement of the expression level of a gene can be direct or indirect, e.g., directly of 
abundance levels of RNAs or proteins or indirectly, by measuring abundance levels of 
cDNAs, amplified RNAs or DNAs, proteins, or activity levels of RNAs or proteins, or other 
molecules (e.g., a metabolite) that are indicative of the foregoing. In one embodiment, the 
profile comprises measurements of abundances of the transcripts of the marker genes. The 
measurements of transcripts can be direct (of the transcripts themselves) or indirect (of, e.g., 
their cDNAs). The measurement of abundance can be a measurement of the absolute 
abundance of a gene product. The measurement of abundance can also be a value 
representative of the absolute abundance, e.g., a normalized abundance value (e.g., an 
abundance normalized against the abundance of a reference gene product) or an averaged 
abundance value (e.g., average of abundances obtained at different time points or from 
different tumor cell samples from the patients, or average of abundances obtained using 
different probes, etc.), or a combination of both. As an example, the measurement of 
abundance of a gene transcript can be a value obtained using an Affymetrix® GeneChip® to 
measure hybridization to the transcript. 

In another embodiment, the expression profile is a differential expression profile 
comprising differential measurements of a plurality of transcripts in a sample derived from 
the patient versus measurements of the plurality of transcripts in a reference sample, e.g., a 
cell sample of normal cells. Each differential measurement in the profile can be but is not 
limited to an arithmetic difference, a ratio, or a log(ratio). As an example, the measurement 
of abundance of a gene transcript can be a value for the transcript obtained using a cDNA or 
ink-jet polynucleotide array in a two-color measurement. 

As used herein, "BRCA1 tumor" or "BRCA1 type" means a tumor having cells 
containing a mutation of the BRCA1 locus. 

A "patient subset" is a group of individuals, all of whom have a particular condition, 
or are subject to a particular condition, which is distinguished from other individuals having 
that condition by one or more phenotypic, genotypic or clinical characteristics of the 
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condition, or of a response to the condition. For example, where the condition is breast 
cancer, individuals may belong to an "ER + " or an "ER~" patient subset, or may belong to a 
particular age group patient subset. 

A gene and/or marker is "informative" for a condition, phenotype, genotype or 
5 clinical characteristic if the expression of the gene or marker is correlated or anticorrelated 
with the condition, phenotype, genotype or clinical characteristic to a greater degree than 
would be expected by chance. 

The patient's estrogen receptor level relative to the patient's age can be measured 
using an appropriate metric designated as ER/AGE. An individual of a given age can be 

1 0 classified as "ER/AGE high" if the individual's ER level is higher than a threshold value for 
the given age. The threshold can be age-dependent, i.e., a different threshold for each 
different age. In one embodiment, the age-dependent threshold value is calculated as c • 
(AGE - d), where c is a coefficient, AGE is the age of the patient, and d is an age threshold. 
The parameters c and d depend on the particular measures and/or units of the ER level and 

1 5 AGE; They can be determined by fitting patients' ER level-age distribution to a bimodal 
distribution of two subgroups each having a different ER level-age dependence. In a specific 
embodiment, c = 0.1 and d = 42.5 is used for ER levels represented by a log(ratio) of ER 
expression level. Thus, for example, the threshold for a 45-year old individual in this 
embodiment is 0.1 (45-42.5), or 0.25, and if the log(ratio) of ER expression level of the 

20 individual is equal to or greater than 0.25, the individual is classified as "ER/AGE high"; 
otherwise, the individual is classified as "ER/AGE low." 

5.1 IDENTIFICATION OF DIAGNOSTIC AND PROGNOSTIC MARKER SETS 
The sets of genes and/or markers that can be used in conjunction with the present 
invention for diagnosis and/or prognosis of breast cancer can be identified using methods 
25 described in this section. In preferred embodiments, the method involves first stratifying 
breast cancer patients according to phenotypic, genotypic and/or clinical parameters into 
subsets, and then identifying markers that discriminate diagnosis and/or prognosis in each 
subsets. 



5.1.1 IDENTIFICATION OF CONDITION SUBSETS 



30 



In one embodiment, the subsets are distinguished by phenotypic, genotypic, and/or 
clinical characteristics of breast cancer. In this embodiment, groups of individuals are 
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classified according to one or more phenotypic, genotypic, or clinical characteristics of breast 
cancer into different patient subsets. At any step in the process of subdividing a patient 
population into patient subsets, the expression level of one or more genes may be determined 
in order to identify whether a prognosis-informative set of genes may be identified for the 

5 particular patient subset. If a gene set is identified for a subset of patients, but is not as 

informative as desired, the patient subset may be further divided and a new geneset identified. 
These subsets may be further subdivided. For example, a group of individuals affected by 
breast cancer may be classified first on the basis of a phenotypic, genotypic or clinical 
characteristic A into subsets SI and S2. The levels of expression of a plurality of genes are 

.0 then determined in tumor samples taken from individuals that fall within subsets SI or S2 in 
order to identify sets of genes informative for prognosis within these subsets. Subsets SI and 
S2 may then each be subdivided into two or more subsets based on other phenotypic, 
genotypic or clinical characteristics. The basis for subdivision, if performed, need not be the 
same for SI and S2. For example, in various embodiments, SI is not subdivided, while S2 is 

15 subdivided on the basis of characteristic B; or SI is subdivided based on characteristic B 
while S2 is not subdivided; or SI and S2 are both subdivided on the basis of characteristic B; 
or SI is subdivided based on characteristic B, while S2 is subdivided according to 
characteristic C; and so on. For a particular decision matrix leading to a plurality of patient 
subsets, the preferred outcome is a prognosis-informative set of genes for each patient subset. 

10 Different decision matrices may lead to different patient subsets, which, in turn, may result in 
different sets of prognosis-informative genes. 

In the specific example, a plurality of phenotypic, genotypic or clinical indications are 
used to classify a breast cancer patient as being a member of one of a plurality of patient 
subsets, wherein the subsets are medically, biochemically or genetically relevant to breast 

25 cancer. For example, a group of patients may be classified into patient subsets based on 
criteria including, but not limited to, estrogen receptor (ER) status, type of tumor (i.e., 
BRCAl-typQ or sporadic), lymph node status, grade of cancer, invasiveness of the tumor, or 
age. "BRCAl-type" indicates that the BRCA1 mutation is present. In each classification 
step, a group of cancer patients may be classified into only two classes, for example, ER+ or 

30 ER - , or into three or more subsets (for example, by tumor grade), depending upon the 
characteristic used to determine the subsets. As used herein, "ER+" indicates that the 
estrogen receptor is expressed at some level; for example, it may indicate that the estrogen 
receptor is detectably expressed, or may indicate that more than 10% of cells may be 
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histologically stained for the receptor, etc. Conversely, "ER-" indicates that the estrogen 
receptor is expressed at a reduced level or not at all; for example, it may indicate that the 
receptor is not detectably expressed, or that 10% or less of cells may be histologically stained 
for the receptor, etc. Marker gene sets optimized for each phenotypic class are preferably 
5 determined after the subsets are established. Where informative markers for a particular 
patient subset, distinguished from another subset by a particular characteristic of the 
condition of interest, cannot be determined, the subset may be further divided by another 
characteristic of the condition to create a plurality of second patient subsets, whereupon genes 
informative for these second patient subsets may be identified. 

1 0 FIG. 1 depicts the process, described in the Examples, of subdivision of a collection 

of breast cancer patients according to phenotypic and genotypic characteristics relevant to 
breast cancer, in preparation for identification of genes informative for prognosis. A 
collection of breast cancer tumor samples was first subdivided by estrogen receptor status. 
ER status was chosen because the presence or absence of the estrogen receptor greatly 

1 5 influences the expression of other genes. In the ER+ patient subset, it was noted that patients 
appeared to be bimodally distributed by ER level vs. age; that is, ER level dependence upon 
age tended to fall within two classes, as separated by the solid line in FIG. 2A. This 
bimodality was used to further subdivide ER+ individuals into "ER+, ER/AGE high" 
individuals and "ER+, ER/AGE low" individuals. A set of informative genes was identified 

20 for the ER+, ER/AGE high patient subset. An informative set was not identified for the ER+, 
ER/AGE low subset, however, so the subset of patients was further divided into LN+ and 
LN- individuals. 



outcome, such as prognosis, may be identified. The methods make use of measured cellular 
constituent profiles, e.g., expression profiles of a plurality of genes (e.g., measurements of 
abundance levels of the corresponding gene products), in tumor samples from a plurality of 
patients in the patient subset whose prognosis outcomes are known. The prognosis outcomes 
30 can be the prognosis at a predetermined time after initial diagnosis. The predetermined time 
can be any convenient time period, e.g., 2, 3, 4, or 5 years. Prognosis markers can be 
obtained by identifying genes whose expression levels correlate with prognosis outcome, e.g., 



5.1.2 IDENTIFICATION OF MARKER SETS INFORMATIVE FOR PATIENT 
SUBSETS 



25 



Once a patient subset is identified, markers, such as genes, informative for a particular 
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genes whose expression levels in good prognosis patients group are significantly different 
from those in poor prognosis patients. In preferred embodiments, the tumor samples from the 
plurality of patients are separated into a good prognosis group and a poor prognosis group for 
the predetermined time period. Genes whose expression levels exhibit differences between 
5 the good and poor prognosis groups to at least a predetermined level are selected as the genes 
whose expression levels correlate with patient prognosis. This section describes 
embodiments which employ genes and gene-derived nucleic acids as markers. However, it 
will be understood by a person skilled in the art that proteins or other cellular constituents 
may also be used as markers. 

10 In a preferred embodiment, the expression profile is a differential expression profile. 

Each measurement in the profile is a differential expression level of a marker in a breast 
tumor sample versus that in a reference sample (also termed a standard or control sample). In 
one embodiment, the reference sample comprises polynucleotide molecules, derived from 
one or more samples from a plurality of normal individuals. For example, the normal 

1 5 individuals may be persons not afflicted with breast cancer. The standard or control may 
alternatively comprise polynucleotide molecules, derived from one or more samples derived 
from individuals having a different form or stage of breast cancer; a different disease or 
different condition, or individuals exposed or subjected to a different condition, than the 
individual from which the sample of interest was obtained. The reference or control may be a 

20 sample, or set of samples, taken from the individual at an earlier time, for example, to assess 
the progression of a condition, or the response to a course of therapy. 

In a preferred embodiment, the standard or control is a pool of target polynucleotide 
molecules derived from a plurality of different individuals. However, where protein levels, 
or the levels of any other relevant biomolecule, are to be compared, the pool may be a pool of 
25 proteins or the relevant biomolecule. In a preferred embodiment in the context of breast 

cancer, the pool comprises samples taken from a number of individuals having sporadic-type 
tumors. 

In another preferred embodiment, the pool comprises an artificially-generated 
population of nucleic acids designed to approximate the level of nucleic acid derived from 
30 each marker found in a pool of marker-derived nucleic acids derived from tumor samples. In 
another embodiment, the pool, also called a "mathematical sample pool," is represented by a 
set of expression values, rather than a set of physical polynucleotides; the level of expression 
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of relevant markers in a sample from an individual with a condition, such as a disease, is 
compared to values representing control levels of expression for the same markers in the 
mathematical sample pool. Such a control may be a set of values stored on a computer. Such 
artificial or mathematical controls may be constructed for any condition of interest. 

5 In another embodiment, the reference sample is derived from a normal breast cell line 

or a breast cancer cell line. In a preferred embodiment, the reference sample comprises 
samples taken from individuals within a specific patient subset, e.g., "ER+, ER/AGE high" 
individuals, wherein each of said individuals has a good prognosis, or each of said individuals 
has a poor prognosis. Of course, where, for example, expressed proteins are used as markers, 

10 the proteins are obtained from the individual's sample, and the standard or control can be a 
pool of proteins from a number of normal individuals, or from a number of individuals 
having a particular state of a condition, such as a pool of samples from individuals having a 
particular prognosis of breast cancer. 

The comparison may be accomplished by any means known in the art. For example, 
1 5 expression levels of various markers may be assessed by separation of target polynucleotide 
molecules (e.g., RNA or cDNA) derived from the markers in agarose or polyacrylamide gels, 
followed by hybridization with marker-specific oligonucleotide probes. Alternatively, the 
comparison may be accomplished by the labeling of target polynucleotide molecules 
followed by separation on a sequencing gel. Polynucleotide samples are placed on the gel 
20 such that patient and control or standard polynucleotides are in adjacent lanes. Comparison 
of expression levels is accomplished visually or by means of densitometer. In a preferred 
embodiment, the expression of all markers is assessed simultaneously by hybridization to a 
microarray. In each approach, markers meeting certain criteria are identified as informative 
for the prognosis of breast cancer. 

25 In one embodiment, genes are first screened based on significant variation in 

expression in a set of breast cancer tumor samples as compared to a standard or control 
sample. Genes may be screened, for example, by determining whether they show significant 
variation in at least some samples among the set of samples. Genes that do not show 
significant variation in at least some samples in the set of samples are presumed not to be 

30 informative, and are discarded from further consideration. Genes showing significant 
variation in at least some samples in the sample set are retained as candidate informative 
genes. The degree of variation in expression of a gene may be estimated by determining a 
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difference or ratio of the expression of the gene in a sample and a control. The difference or 
ratio of expression may be further transformed, e.g., by a linear or log transformation. 
Selection of candidate markers may be made based upon either significant up- or down- 



5 significance (e.g., the p-value) of the variation in expression of the gene. Preferably, both 
selection criteria are used. Thus, in one embodiment of the present invention, genes showing 
both a more than two-fold change (increase or decrease) in expression as compared to a 
standard in at least three samples, and a p-value of variation in expression of the gene in the 
set of tumor samples as compared to the standard sample is no more than 0.01 (i.e., is 

10 statistically significant) are selected as candidate genes associated with prognosis of breast 
cancer in a patient subset. 

In the present invention, a "good prognosis" predicts non-occurrence of metastases 
within a predetermined period of time, e.g., 1, 2, 3, 4, 5 or more years, after initial diagnosis, 
whereas a "poor prognosis" predicts occurrence of metastasis within that period. In a specific 
15 embodiment, a "good prognosis" predicts non-occurrence of metastases within 5 years after 
initial diagnosis, and a "poor prognosis" predicts occurrence of metastasis within that period. 

Expression profiles comprising a plurality of different genes in a plurality of N breast 
cancer tumor samples can be used to identify markers that correlate with, and therefore are 
useful for discriminating, different clinical categories. In a specific embodiment, a 

20 correlation coefficient p between a vector c representing clinical categories or clinical 
parameters, e.g., a good or poor prognosis, in the N tumor samples and a vector 
r representing the measured expression levels of a gene in the N tumor samples is used as a 
measure of the correlation between the expression level of the gene and clinical category. 
The expression levels can be a measured abundance level of a transcript of the gene, or any 

25 transformation of the measured abundance, e.g., a logarithmic or a log ratio. Specifically, the 
correlation coefficient may be calculated as: 



Markers for which the coefficient of correlation exceeds a cutoff are identified as 
prognosis-informative markers specific for a particular clinical category, e.g., a good 
30 prognosis, within a given patient subset. Such a cutoff or threshold may correspond to a 
certain significance of the set of obtained discriminating genes. The threshold may also be 



regulation of the gene in at least some samples in the set or based on the statistical 



a) 
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selected based on the number of samples used. For example, a threshold can be calculated as 
3X1/ Vw-3 , where 1 / Vn-3 is the distribution width and n = the number of samples. In a 
specific embodiment, markers are chosen if the correlation coefficient is greater than about 
0.3 or less than about -0.3. 

5 Next, the significance of the set of marker genes can be evaluated. The significance 

may be calculated by any appropriate statistical method. In a specific example, a Monte- 
Carlo technique is used to randomize the association between the expression profiles of the 
plurality of patients and the clinical categories to generate a set of randomized data. The 
same marker selection procedure as used to select the marker set is applied to the randomized 

10 data to obtain a control marker set. A plurality of such runs can be performed to generate a 
probability distribution of the number of genes in control marker sets. In a preferred 
embodiment, 10,000 such runs are performed. From the probability distribution, the 
probability of finding a marker set consisting of a given number of markers when no 
correlation between the expression levels and phenotype is expected (i.e., based randomized 

15 data) can be determined. The significance of the marker set obtained from the real data can 
be evaluated based on the number of markers in the marker set by comparing to the 
. probability of obtaining a control marker set consisting of the same number of markers using 
the randomized data. In one embodiment, if the probability of obtaining a control marker set 
consisting of the same number of markers using the randomized data is below a given 

20 probability threshold, the marker set is said to be significant. 

Once a marker set is identified, the markers may be rank-ordered in order of 
correlation or significance of discrimination. One means of rank ordering is by the amplitude 
of correlation between the change in gene expression of the marker and the specific condition 
being discriminated. Another, preferred, means is to use a statistical metric. In a specific 
25 embodiment, the metric is a t-test-like statistic: 

, = «*»>-<*>>}/ , (2) 

/VKCh, -l) + <r 2 2 (« 2 -l)]/^ +„ 2 -1)/(1/ Wl +l/« 2 ) 

In this equation, (x { ) is the error-weighted average of the log ratio of transcript expression 
measurements within a first clinical group {e.g., good prognosis), (x 2 ) is the error-weighted 
average of log ratio within a second, related clinical group (e.g., poor prognosis), a t is the 
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variance of the log ratio within the first clinical group (e.g., good prognosis), H\ is the number 
of samples for which valid measurements of log ratios are available, tr 2 is the variance of log 
ratio within the second clinical group (e.g., poor prognosis), and « 2 is the number of samples 
for which valid measurements of log ratios are available. The f-value represents the variance- 
compensated difference between two means. The rank-ordered marker set may be used to 
optimize the number of markers in the set used for discrimination. 

A set of genes for prognosis of breast cancer can also be identified using an iterative 
approach. This is accomplished generally in a "leave one out" method as follows. In a first 
run, a subset, for example five, of the markers from the top of the ranked list is used to 
generate a template, where out of N samples, JV-1 are used to generate the template, and the 
status of the remaining sample is predicted. This process is repeated for every sample until 
every one of the N samples is predicted once. In a second run, one or more additional 
markers, for example five additional markers, are added, so that a template is now generated 
from 10 markers, and the outcome of the remaining sample is predicted. This process is 
repeated until the entire set of markers is used to generate the template. For each of the runs, 
type 1 error (false negative) and type 2 errors (false positive) are counted. The set of top- 
ranked markers that corresponds to lowest type 1 error rate, or type 2 error rate, or preferably 
the total of type 1 and type 2 error rate is selected. 

For prognostic markers, validation of the marker set may be accomplished by an 
additional statistic, a survival model. This statistic generates the probability of tumor distant 
metastases as a function of time since initial diagnosis. A number of models may be used, 
including Weibull, normal, log-normal, log logistic, log-exponential, or log-Rayleigh 
(Chapter 12 "Life Testing", S-PLUS 2000 Guide TO Statistics, Vol. 2, p. 368 (2000)). For 
the "normal" model, the probability of distant metastases P at time t is calculated as 



where a is fixed and equal to 1, and X is a parameter to be fitted and measures the "expected 
lifetime". 

It is preferable that the above marker identification process be iterated one or more 
times by excluding one or more samples from the marker selection or ranking (i.e., from the 




(3) 
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calculation of correlation). Those samples being excluded are the ones that can not be 
predicted correctly from the previous iteration. Preferably, those samples excluded from 
marker selection in this iteration process are included in the classifier performance 
evaluation, to avoid overstating the performance. 

5 5.1.3 CLASSIFIER GENESETS FOR PATIENT SUBSETS 

This section provides exemplary sets of markers useful for the prognosis of breast 
cancer. The markers were identified according to the above methods in specific subsets of 
individuals with breast cancer. Generally, the marker sets can be used for prognosis of breast 
cancer patients that fall into five phenotypic categories based on criteria relevant to breast 

10 cancer prognosis, including estrogen receptor (ER) status, lymph node status, type of 

mutation(s) (i.e., BRCAl-type or sporadic) and age at diagnosis. More specifically, patients, 
and tumors from which samples were taken, were classified as ER - , sporadic (i.e., being both 
estrogen receptor negative and having a non-BRCAl-type tumor); ER~ BRCA1 (i.e., being 
both estrogen receptor negative and having a BRCAl-type tumor); ER+, ER/AGE high (i.e., 

1 5 estrogen receptor positive with a high ratio of the log (ratio) of estrogen receptor gene 

expression to age); ER+, ER/AGE low, LN+ (i.e., estrogen receptor positive with a low ratio 
of the log (ratio) of estrogen receptor gene expression to age, lymph node positive); and ER + , 
ER/AGE low, LN _ (i.e., estrogen receptor positive with a low ratio of the log (ratio) of 
estrogen receptor gene expression to age, lymph node negative). The rationale for 

20 subdivision of the original patient set into these five subsets is detailed in the Examples 

(Section 6). The marker sets useful for each of the subsets above are provided in Tables 1-5, 
respectively. 



Table 1 : Geneset of 20 markers used to classify ER , sporadic individuals. 



Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xref_keyword_list 


AF055033 


IGFBP5 


-2.12 


0.88 


0.54 


insulin-like growth 
factor binding protein 
5 


Growth factor binding, 
Glycoprotein, Signal, 3D- 
structure 


NM_000599 


IGFBP5 


-3.41 


0.43 


0.53 


insulin-like growth 
factor binding protein 
5 


Growth factor binding, 
Glycoprotein, Signal, 3D- 
structure 


L27560 


IGFBP5 


-4.55 


0 


0.52 


EST 


Hypothetical protein 


AF052162 


FLJ 12443 


-0.27 


1.6 


0.52 


EST 


Hypothetical protein 


NMJD01456 


FLNA 


-0.61 


2.47 


0.52 


filamin A, alpha (actin 
binding protein 280) 


Hypothetical protein, 
Actin-binding, 
Phosphorylation, Repeat, 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xref_keyword_list 














Polymorphism, Disease 
mutation 


NM_002205 


TGA5 


-0.37 


2.08 


0.49 


ntegrin, alpha 5 
[fibronectin receptor, 
alpha polypeptide) 


ntegrin, Cell adhesion, 
Receptor, Glycoprotein, 
Transmembrane, Signal, 
Calcium, Repeat 


NM_013261 


PPARGC1 


0.09 


1.54 


0.47 


peroxisome proliferative activated receptor, 
qamma, coactivator 1 


NM_001605 


AARS 


0.39 


2.36 


0.51 


alanyl-tRNA 
synthetase 


Aminoacyl-tRNA 
synthetase, Protein 
biosynthesis, Ligase, 
ATP-binding 


X87949 


HSPA5 


-0.03 


2.03 


0.49 


heat shock 70kDa 
protein 5 (glucose- 
regulated protein, 
78kDa) 


ATP-binding, Hypothetical 
protein, Endoplasmic 
reticulum, Signal 


Contig50950_RC 


NGEF 


-1.17 


3.2 


0.52 


neuronal guanine nucleotide exchange factor 


NM_005689 


ABCB6 


-0.51 


2.26 


0.48 


ATP-binding cassette, 
sub-family B 
(MDR/TAP), member 
6 


ATP-binding, Transport, 
Transmembrane, 
Mitochondrion, Inner 
membrane, Transit 
peptide, Hypothetical 
protein 


NM_004577 


PSPH 


-0.56 


3.05 


0.51 


phosphoserine 
phosphatase 


Hydrolase, Serine 
biosynthesis, Magnesium, 
Phosphorylation 


NM_003832 


PSPHL 


-2.08 


2.18 


0.5 


phosphoserine 
phosphatase-like 




NM_002422 


MMP3 


-0.96 


2.54 


0.5 


matrix 

metalloproteinase 3 
(stromelysin 1 , 
progelatinase) 


Hydrolase, 
Metalloprotease, 
Glycoprotein, Zinc, 
Zymogen, Calcium, 
Collagen degradation, 
Extracellular matrix, 
Signal, Polymorphism, 
3D-structure 


Contig37562_RC 




-3.42 


-6.02 


-0.59 


ESTs 




NM_0 18465 


MDS030 


-0.82 


-3.28 


-0.58 


uncharacterized 
hematopoietic 
stem/progenitor cells 
protein MDS030 


Hypothetical protein 


Contig54661 RC 




-0.7S 


-2.08 


-0.54 


ESTs 




AB032969 


KIAA1143 


-o.e 


-2.85 


-0.53 


KIAA1143 protein 


Hypothetical protein 


Contig55353_RC 


KIAA1915 


-0.27 


-1.82 


-0.47 


KIAA1 91 5 protein 


Hypothetical protein 


NM_005213 


CSTA 


2.11 


-3.4| -0.4S 


cystatin A (stefin A) 


Thiol protease inhibitor, 
3D-structure 



Table 2. Geneset of 10 markers used to classify ER , BRCA1 individuals. 



Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keyword li 

St 


AF005487 




6.08 


0.5 


-0.79 


HLA-DRB6 


Homo sapiens 
MHC class II 
antigen (DRB6) 


MHC 
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mRNA, HLA- 
DRB6*0201 allele, 
sequence. 




Contig50728_RC 
Contig53598_RC 




4.02 


0.25 


-0.77 




ESTs, Weakly similar to S26650 DNA- 
binding protein 5 - human [H.sapiens] 


8.41 


3.26 


-0.77 


FLJ11413 


hypothetical protein 
FLJ11413 


Hypothetical protein 


NM_002888 


RARR 
ES1 


6.9 


0.05 


-0.87 


RARRES1 


retinoic acid 
receptor responder 
(tazarotene 
induced) 1 


Receptor, 

Transmembrane, 

Signal-anchor 














defensin, beta 1 


Antibiotic, Signal, 3D- 
structure 


U17077 


BENE 


2.72 


-1.72 


-0.77 


BENE 


BENE protein 


Transmembrane 


Contig14683_RC 




1.29 


-2.31 


-0.74 




ESTs 




Contig53641_RC 




-3.29 


4.23 


0.75 


MAGE-E1 


MAGE-E1 protein 


Hypothetical protein 


Contig56678_RC 




-6.7 


-9.73 


-0.82 




ESTs, Highly similar to THYA_HUMAN 
Prothymosin alpha [H.sapiens] 


NM_005461 


KRML 


0.88 


-3.38 


-0.75 


MAFB 


v-maf 

musculoaponeuroti 
c fibrosarcoma 
oncogene homolog 
B (avian) 


Transcription 
regulation, Repressor, 
DNA-binding, Nuclear 
protein, Hypothetical 
protein 



Table 3. Geneset of 50 markers used to classify ER+, ER/AGE high individuals. 



Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xref_keyword_llst 


NM_003600 


STK15 


-2.93 


2.08 


0.8 


serine/threonine 
kinase 6 


ATP-binding, Kinase, 
Serine/threonine-protein 
kinase, Transferase 


NM_003158 


STK6 


-1.57 


1.42 


0.78 


serine/threonine 
kinase 6 


ATP-binding, Kinase, 
Serine/threonine-protein 
kinase, Transferase 


NM_007019 


UBCH10 


-2.98 


2.62 


0.81 


ubiquitin-conjugating 
enzyme E2C 


Hypothetical protein, Ubl 
conjugation pathway, 
Ligase, Multigene family, 
Mitosis, Cell cycle, Cell 
division 


NM_013277 


ID-GAP 


-2.43 


2.43 


0.77 


Rac GTPase 
activating protein 1 


Hypothetical protein 


NM_004336 


BUB1 


-2.04 


1.39 


0.77 


BUB1 budding 
uninhibited by 
benzimidazoles 1 
homolog (yeast) 


Transferase, 
Serine/threonine-protein 
kinase, ATP-binding, Cell 
cycle, Nuclear protein, 
Mitosis, Phosphorylation, 
Polymorphism 


NM_006607 


PTTG2 


-1.71 


1.49 


0.72 


pituitary tumor- 
transforming 2 




AK001166 


FLJ 11252 


-1.33 


0.99 


0.71 


hypothetical protein 
FLJ 11252 


Hypothetical protein 


NM_004701 


CCNB2 


-4.62 


2.01 


0.81 


cyclin B2 


Cyclin, Cell cycle, Cell 
division, Mitosis 


Contig57584_RC 




-3.68 


2.04 


0.78 


likely ortholog of mouse gene rich cluster, C8 
gene 


NM 006845 


KNSL6 


-4.13 


1.05 


0.73 


kinesin-like 6 (mitotic|Hypothetical protein, 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref keyword list 












centromere- 
associated kinesin) 


Motor protein, 
Microtubules, ATP- 
binding, Coiled coil, 
Nuclear protein 


Contig38901_RC 




-3.08 


1.15 


0.75 


hypothetical protein 
MGC45866 


Hypothetical protein 


NM_018410 


DKFZp76 
2E1312 


-4.38 


1.49 


0.75 


hypothetical protein 
DKFZp762E1312 


Hypothetical protein 


NM_003981 


PRC1 


-3.52 


2.17 


0.78 


protein regulator of 
cytokinesis 1 




NM_001809 


CENPA 


-5.04 


0.98 


0.75 


centromere protein 
A, 17kDa 


Hypothetical protein 
Chromosomal protein, 
Nuclear protein, DNA- 
binding, Centromere, 
Antigen 


NM_003504 


CDC45L 


-2.67 


1.22 


0.73 


CDC45 cell division 
cycle 45-like (S. 
cerevisiae) 


DNA replication, Cell 
cycle, Nuclear protein, 
Cell division 


Contig41413_RC 




-5.43 


2.15 


0.74 


ribonucleotide 
reductase M2 
polypeptide 


Oxidoreductase, DNA 
replication, Iron 


NM_004217 


STK12 


-2.17 


0.73 


0.72 


serine/threonine 
kinase 12 


Hypothetical protein, ATP- 
binding, Kinase, 
Serine/threonine-protein 
kinase, Transferase 


NM_002358 


MAD2L1 


-2.65 


2.27 


0.83 


MAD2 mitotic arrest 
deficient-like 1 
(yeast) 


Cell cycle Mitosis 
Nuclear protein, 3D- 
structure 


NM_014321 


ORC6L 


-2.73 


1.8 


0.75 


origin recognition 
complex, subunit 6 
homolog-like (yeast) 


Hypothetical protein, DNA 
replication, Nuclear 
protein, DNA-binding 


NM_012291 


KIAA0165 


-1.52 


1.55 


0.71 


extra spindle poles 
like 1 (S. cerevisiae) 


Hypothetical protein 


NM_004203 


PKMYT1 


-3.64 


2.2 


0.7 


rstinoblsstoms-liks 2 
( P 130) 


ATP-binding, Kinase, 
Serine/threonine-protein 
kinase, Transferase, 
Transcription regulation, 
DNA-binding, Nuclear 
protein, Cell cycle, 
Phosphorylation, Anti- 
oncogene 


M96577 


E2F1 


-2.14 


1.42 


0.75 


E2F transcription 
factor 1 


Transcription regulation, 
Activator, DNA-binding, 

Phosphorylation, Cell 
cycle, Apoptosis, 
Polymorphism 


NM_002266 


KPNA2 


-3.77 


1.78 


0.71 


karyopherin alpha 2 
(RAG cohort 1 , 
importin alpha 1) 


Transport, Protein 
transport, Repeat, 
Nuclear protein, 
Polymorphism 


Contig31288_RC 




-2.63 


0.7 


0.68 


ESTs, Weakly similar 
FLJ20489 [Homo sap 


to hypothetical protein 
iens] [H.sapiens] 


NM_014501 


E2-EPF 


-1.55 


1.93 


0.7 


ubiquitin carrier 
protein 


Ubl conjugation pathway, 
Ligase, Multigene family 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xref_keyword_list 


NM_001168 


BIRC5 


-5.76 


2.01 


0.78 


baculoviral IAP 
repeat-containing 5 
(survivin) 


Apoptosis, Thiol protease 
inhibitor, Alternative 
splicing, 3D-structure, 
Hypothetical protein, 
Protease, Receptor 


NM_003258 


TK1 


-4.57 


1.38 


0.71 


thymidine kinase 1 , 
soluble 


Transferase, Kinase, DNA 
synthesis, ATP-binding 


NM_001254 


CDC6 


-2.46 


0.28 


0.72 


CDC6 cell division 
cycle 6 homolog (S. 
cerevisiae) 


ATP-binding, Cell division 


NM_004900 


DJ742C19 

.2 


-2.96 


0.13 


0.69 


apolipoprotein B 
mRNA editing 
enzyme, catalytic 
polypeptide-like 3B 


Hydrolase 


NM_004702 


CCNE2 


-3.12 


2.13 


0.81 


cyclin E2 


Cell cycle, Cell division, 
Cyclin, Hypothetical 
protein, Phosphorylation, 
Alternative splicing, 
Nuclear protein 


AL160131 




-3.07 


2.42 


0.7 


hypothetical protein 
MGC861 


Hypothetical protein 


NM_016359 


LOC5120 
3 


-3.22 


2.61 


0.76 


nucleolar protein 
ANKT 


Hypothetical protein, 
Nuclear protein 


NM_004856 


KNSL5 


-1.52 


1.1 


0.71 


kinesin-like 5 (mitotic 
kinesin-like protein 
1) 


Motor protein, Cell 
division, Microtubules, 
ATP-binding, Coiled coil, 
Mitosis, Cell cycle, 
Nuclear protein 


NM_000057 


BLM 


-1.54 


0.76 


0.71 


Bloom syndrome 


Hydrolase, Helicase, 
ATP-binding, DNA- ' 
binding, Nuclear protein, 
DNA replication, Disease 
mutation 


NM_018455 


BM039 


-2.44 


1.18 


0.7 


uncharacterized 
bone marrow protein 
BM039 




NM_002106 


H2AFZ 


-2.49 


1.53 


0.72 


H2A histone family, 
member Z 


Chromosomal protein, 
Nucleosome core, 
Nuclear protein, DNA- 
binding, Multigene family 


Contig64688 




-2.68 


3.1 


0.73 


hypothetical protein 
FLJ23468 


Hypothetical protein 


Contig44289 RC 




-1.65 


1.6 


0.67 


ESTs 




Contig28552_RC 




-1.37 


1.53 


0.68 


diaphanous homolog 
3 (Drosophila) 


Hypothetical protein, 
Coiled coil, Repeat, 
Alternative splicina 


Contig46218_RC 




-1.31 


1.56 


0.68 


ESTs, Weakly similar to T19201 hypothetical 
protein C11G6.3 - Caenorhabditis elegans [C. 
elegans] 


Contig28947_RC 




-1.3 


0.98 


0.67 


cell division cycle 
25A 


Hypothetical protein, Cell 
division, Mitosis, 
Hydrolase, Alternative 
splicing, Multigene family, 
3D-structure 


NM_016095 


LOC5165 


-1.4 


2.13 


0.67 


HSPC037 protein 


Hypothetical protein 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xref_keyword_!ist 




9 












NMJJ03090 


SNRPA1 


-3.26 


0.95 


0.7 


small nuclear ; 
ribonucleoprotein 
polypeptide A' 


Hypothetical protein, 
Nuclear protein, RNA- 
binding, 

Ribonucleoprotein, 
Leucine-rich repeat, 
Repeat, 3D-structure 












proteasome 
(prosome, 
macropain) 26S 
subunit, non- 
ATPase, 7 (Mov34 
homolog) 


Proteasome 












hypothetical protein 
DKFZp762A2013 


Hypothetical protein 


NM_003406 


YWHAZ 


-1.5 


2.79 


0.68 


tyrosine 3- 
monooxygenase/tryp 
tophan 5- 
monooxygenase 
activation protein, 
zeta polypeptide 


Brain, Neurone, 
Phosphorylation, 
Acetylation, Multigene 
family, 3D-structure 


AM- I 0 / U**U 










netrin 4 


Hypothetical protein, 
Laminin EGF-like domain, 
Signal 


AL049367 




1.9 


-3.2 


-0.68 


EST 


Transducer, Prenylation, 
Lipoprotein, Multigene 
family, Acetylation 


NM.013409 


FST 


1.04 


-5.78 


-0.69 


follistatin 


Glycoprotein, Repeat, 
Signal, Alternative splicing 


NM_000060 


BTD 


3.1 


-1.45 


-0.67 


biotinidase 


Hydrolase, Glycoprotein, 
Signal, Disease mutation 



Table 4. Geneset of 50 markers used to classify ER+, ER/AGE low, LN+ individuals. 



Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp_xref_keyword_list 


NM_006417 


MTAP44 


-1.5 


3 


0.69 


Fc fragment of IgG, 
low affinity lib, 
receptor for (CD32) 


Hydrolase, Hypothetical 
protein, Immunoglobulin 
domain, IgG-binding 
protein, Receptor, 
Transmembrane, 
Glycoprotein, Signal, 
Repeat, Multigene family, 
Polymorphism, NAD, 
One-carbon metabolism, 
Serine protease, 
Zymogen, Protease, 
Alternative splicing, 
Chromosomal 
translocation, Proto- 
oncogene, Galaptin, 
Lectin, Antigen 


NM_006820 


GS3686 


-4.3 


4.06 


0.69 


chromosome 1 open 
reading frame 29 


Hypothetical protein 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp_xref_keyword_list 


NM.001548 


FIT1 


-3.4 


4.27 


0.71 


nterferon-induced 
Drotein with 
etratricopeptide 
epeats 1 


Repeat, TPR repeat, 
nterferon induction 


Contig41538_RC 




-2.5 


3.16 


0.68 


=STs, Moderately similar to hypothetical 
irotein FLJ20489 [Homo sapiens] 


NM.016816 


DAS1 


-1.7 


3.29 


0.75 


2',5'-oligoadenylate 
synthetase 1, 
40/46kDa 


RNA-binding, 
Transferase, 
Nucleotidyltransferase, 
nterferon induction, 
Alternative splicing 


Contig51660_RC 




-2.1 


2.65 


0.66 


28kD interferon 
responsive protein 


Transmembrane 


Contig43645_RC 




-4.8 


1.44 


0.63 


Homo sapiens, 
clone 

MAGE:4428577, 
mRNA, partial cds 


Hypothetical protein 


AF026941 




-4.6 


2.71 


0.63 


EST, Weakly similar 
to 2004399A 
chromosomal 
protein [Homo 
sapiens] 


Hypothetical protein 


IMM_UU/o1C3 










signal transducer 
and activator of 
transcription 1 , 
91kDa 


Transcription regulation, 
DNA-binding, Nuclear 
protein, Phosphorylation, 
SH2 domain, Alternative 
splicing, 3D-structure 


NM_002038 


G1P3 


-4.1 


5.64 


0.79 


interferon, alpha- 
inducible protein 
(clone IFI-6-16) 


Interferon induction, 
Transmembrane, Signal, 
Alternative splicing 


NM_005101 










interferon-stimulated 
protein, 15 kDa 


Interferon induction, 
Repeat 


NM_002462 


MX1 


-6.1 


0.83 


0.56 


myxovirus (influenza 
virus) resistance 1, 
interferon-inducible 
protein p78 (mouse) 


Hypothetical protein, 
Interferon induction, GTP- 
binding, Multigene family, 
Antiviral 


NMJ305532 


IFI27 


-5.8 


2.81 


0.59 


interferon, alpha- 
inducible protein 27 


Interferon induction, 
Transmembrane 


NM_002346 


LY6E 


-2.1 


3.58 


0.75 


lymphocyte antigen 
6 complex, locus E 


Signal, Antigen, Multigene 
family, Membrane, GPI- 
anchor 


MM AHCQ47 

Nm_UlOO i / 










2'-5'-oligoadenylate 
synthetase 2, 
69/71 kDa 


RNA-binding, 
Transferase, 
Nucleotidyltransferase, 
Repeat, Interferon 
induction, Alternative 
splicing, Myristate 


Contig44909_RC 




-2.: 


j 1.1: 


\ 0.5 


) hypothetical protein 
BC012330 


Hypothetical protein 


NM_017414 


USP18 


-4. 


3.3" 


r 0.7: 


I ubiquitin specific 
protease 18 


Ubl conjugation pathway, 
Hydrolase, Thiol protease, 
Multigene family 


NM_004029 


IRF7 


-2. 


% 3.6 


1 0.6 


5 interferon regulatory 
factor 7 


Collagen, Transcription 
regulation, DNA-binding, 
Nuclear protein, Activator, 
Alternative splicing 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp_xref_keyword_list 


NM_004335 


BST2 


-3.2 


3.22 


0.57 


bone marrow 
stromal cell antigen 
2 


Transmembrane, 
Glycoprotein Signal- 
anchor, Polymorphism 


NM_002759 


PRKR 


-2.4 


1.8 


0.58 


protein kinase, 
interferon-inducible 
double stranded 
RNA dependent 


Transferase, 
Serine/threonine-protein 
kinase, ATP-binding, 

Interferon induction, RNA- 
binding, 3D-structure 


NM_006332 


IFI30 


-3.8 


2.65 


0.64 


interferon, gamma- 
inducible protein 30 


Oxidor6ductdS6 
Interferon induction, 
Glycoprotein, Lysosome, 
Signal, Hypothetical 
protein 


NM_009587 


LGALS9 


-3.2 


2.08 


0.6 


lectin, galactoside- 
binding, soluble, 9 
(galectin 9) 


Galaptin, Lectin Repeat 
Alternative splicing 


NM_003641 


IFITM1 


-2.4 


5.54 


0.63 


interferon induced 
transmembrane 
protein 1 (9-27) 


Interferon induction 
Transmembrane 


NM_017523 


HSXIAPA 
F1 


-1 


2.84 


0.7 


XIAP associated 
factor-1 


Hypothetical protein 


NM_014314 


RIG-I 


-1.3 


3.55 


0.62 


RNA helicase 


ATP-binding, Helicase, 
Hydrolase, Hypothetical 
protein 


Contig47563 RC 




-2.2 


3.11 


0.56 


ESTs 




AI497657_RC 




-4.4 


5.61 


0.74 


guanine nucleotide 
binding protein 4 


Transducer, Prenylation, 
Lipoprotein, Multigene 


NM_000735 


CGA 


-4.3 


2.5 


0.58 


glycoprotein 
hormones, alpha 
polypeptide 


Hormone, Glycoprotein, 
Signal, 3D-structure 


NM_004988 


MAGEA1 


-1.4 


6.31 


0.64 


melanoma antigen, 
family A, 1 (directs' 
expression of 
antigen MZ2-E) 


Antigen Multigene family 
Polymorphism, Tumor 
antigen 


Contig54242_RC 




-1.2 


4.1 


0.65 


chromosome 17 
open reading frame 
26 


Hypothetical protein 


NM_004710 


SYNGR2 


-1.4 


3.01 


0.54 


synaptogyrin 2 


transmembrane j 


NM_001168 


BIRC5 


-3.7 


3.39 




baculoviral IAP 
repeat-containing 5 
(survivin) 


Hypothetical protein, 
Protease, Receptor, 
Apoptosis, Thiol protease 
inhibitor, Alternative 
splicing, 3D-structure 


Contig41413_RC 




-4.4 


2.61 


0.57 


ribonucleotide 
reductase M2 
polypeptide 


Oxidoreductase, DNA 
replication, Iron 


NM_004203 


PKMYT1 


-3.4 


3.79 


0.6 


retinoblastoma-like 2 
(p130) 


ATP-binding, Kinase, 
Serine/threonine-protein 
kinase, Transferase, 
Transcription regulation, 
DNA-binding, Nuclear 
protein, Cell cycle, 
Phosphorylation, Anti- 
oncogene 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp_xref_keywordJist 


Contig48913_RC 




-3.1 


1.72 


0.55/ 


-iomo sapiens, Similar to hypothetical protein 
3R01722, clone MGC:15692 IMAGE:3351479, 
nRNA, complete cds 


NM_005804 


DDXL 


-2.5 


1.42 


0.58 


DEAD/H (Asp-Glu- 
Ma-Asp/His) box 
oolypeptide 39 


ATP-binding, Helicase, 
Hydrolase, Hypothetical 
orotein 


NM_016359 


-OC5120 
3 


-1.7 


3.6 


0.57 


nucleolar protein 
ANKT 


Hypothetical protein, 
Nuclear protein 


NM_001645 


*POC1 


-2.9 


3.43 


0.58 


ipolipoprotsin C-l 


3 lasma Lipid transport 
VLDL, Signal, 3D- . 
structure, Polymorphism 


Contig37895_RC 




-2 


2.05 


0.55 


ESTs 




NM_005749 


TOB1 


-1.3 


4.96 


0.59 


transducer of 
ERBB2, 1 


Phosphorylation 


NM 000269 


VJME1 


-1.3 


2.98 


0.55 


non-metastatic cells 
i, proiein ^iniviziSm; 
expressed in 


Transferase, Kinase, 
ATP-binding, Nuclear 
arotein, Anti-oncogene, 
Disease mutation 


NM_014462 


LSM1 


-1 


4.5 


0.57 




Nuclear protein, 
Ribonucleoprotein, mRNA 
splicing, mRNA 
processing RNA-binding 


Contig31221_RC 




-1.4 


3.83 


0.56 


HTPAP protein 




NM_005326 


HAGH 


-1.9 


4.29 


0.57 


hydroxyacyl 
glutathione 
hydrolase 


Hydrolase, Zinc, 3D- 
structure 


Contig42342_RC 




0.78 


-3.2 


-0.6 


Homo sapiens cDNA 
FLJ39417fis, clone 
PLACE6016942 


Hypothetical protein 


AL1 37540 


NTN4 


2.24 


-3.9 


-0.6 


netrin 4 


Laminin EGF-like domain, 
Signal, Hypothetical 
protein 


Contig40434_RC 




1.64 


-5.6 


-0.6 


wingless-type MMTV 
integration site 
family, member 5A 


Developmental protein, 
Glycoprotein, Signal 


Contig 1632_RC 




1.03 


-3.9 


-0.6 


hypothetical protein 
MGC17921 


Hypothetical protein 


NM_014246 


CELSR1 


0.95 


-4.6 


-0.6 


cadherin, EGF LAG 
seven-pass G-type 
receptor 1 (flamingo 
homolog, 
Drosophila) 


G-protein coupled 
receptor, 
Transmembrane, 
oiycoproiein, cur-iiKe 
domain, Calcium-binding, 
Laminin EGF-like domain, 
Repeat, Developmental 
protein, Hydroxylation, 
Signal, Alternative 
splicing, Hypothetical 
protein 


NM_005139 


ANXA3 


1.2C 


5 -6.: 


> -o.c 


5 annexin A3 


Annexin, 

Calcium/phospholipid- 
binding, Repeat, 
Phospholipase A2 
inhibitor, 3D-structure, 
Polymorphism 



-33- 




WO 2006/084272 PCT/US2006/004280 



Table 5. Geneset of 65 markers used to classify ER+, ER/AGE low, LIST individuals. 



Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Sorrel- 
ation 


Sequence 
name 


Description 


Sp xref keywordji 

St 


M55914 


VIPB1 


-2.82 


1.25 


0.5 


EN01 ( 


snolase 1, (alpha) 


DNA-binding, 
rranscription 
'egulation, Repressor, 
Muclear protein, 
Lyase, Glycolysis, 
Magnesium, 
vlultigene family, 
Hypothetical protein 


NM_005945 


MPB1 


-3.06 


1.19 


0.49 


ENOI 


Homo sapiens 
enolase 1 , (alpha) 
EN01), mRNA. 


Glycolysis, 
Hypothetical protein, 
Lyase, Magnesium, 
DNA-binding, 
Transcription 
regulation, Repressor, 
Nuclear protein, 
Multigene family 


NM.001428 


EN01 


-2.53 


1.18 


0.46 


EN01 


enolase 1, (alpha) 


DNA-binding, 
Transcription 
regulation, Repressor, 
Nuclear protein, 
Lyase, Glycolysis, 
Magnesium, 
Multigene family, 
Hypothetical protein 


NM_001216 


CA9 


-4.72 


1.49 


0.6 


CA9 


carbonic 
anhydrase IX 


Lyase, Zinc, 
Transmembrane, 
Glycoprotein, Antigen, 
Signal, Nuclear 
protein, 
Polymorphism 


NM_001124 


ADM 


-5.68 


2.99 


0.56 


ADM 


adrenomedullin 


Hormone, Amidation, 
Cleavage on pair of 
basic residues, Signal 


NM_000584 


IL8 


-2.45 


2.04 


0.54 


IL8 


interleukin 8 


Cytokine, 
Chemotaxis, 
Inflammatory 
response, Signal, 
Alternative splicing, 
3D-structure 


D25328 


PFKP 


-4.1S 


3.2S 


0.56 


PFKP 


Phosphofructo- 
kinase, platelet 


Kinase, Transferase, 
Glycolysis, Repeat, 
Allosteric enzyme, 
Phosphorylation, 
Magnesium, 
Multigene family 


NM_006096 


NDRG1 


-5.4e 


5.9" 


0.7" 


NDRG1 


N-myc downstream 
regulated gene 1 


Hypothetical protein, 
Nuclear protein, 
Repeat 


NM_004994 


MMP9 


-5.5. 


J 1.0- 


' 0.45 


)MMP9 


matrix 

metalloproteinase 
9 (gelatinase B, 
92kDa gelatinase, 
92kDa type IV 
collagenase) 


Hydrolase, 
Metalloprotease, 
Glycoprotein, Zinc, 
Zymogen, Calcium, 
Collagen degradation, 
Extracellular matrix, 
Repeat, Signal, 
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Accession/ 
Contig No. 


Gene 


Avg 
jood 
tcdev 


Avg ( 

poor 

icdev 


Jorrel- 
ation 


Sequence 
name 


Description 


Sp xref keywordji 

St 
















3 olymorphism, 3D- 
tructure 


NMJ303311 1 


rssc3 


-4.57 


5.58 


0.68- 


rssc3 t 


umor suppressing subtransferable 
;andidate 3 


NM_006086 


rUBB4 


-5.19 


2.85 


0.59' 


rUBB4 


ubulin, beta, 4 


3-protein coupled 
eceptor, 

Transmembrane, 

Glycoprotein, 

3 hosphorylation, 

Lipoprotein, 

Palmitate, 

'olymorphism, 

Hypothetical protein, 

GTP-binding, 

Receptor, 

Microtubules, 

Multigene family 


NM_006115 


PRAME 


-4.48 


2.77 


0.61 


PRAME 


preferentially 
expressed antigen 
in melanoma 


Antigen 


NM_004345 


CAMP 


-2.02 


1.37 


0.49 


CAMP 


cathelicidin 
antimicrobial 
peptide . 


Antibiotic, Signal 


NM.018455 


BM039 


-2.34 


0.76 


0.47 


BM039 


uncharacterized 
bone marrow 
protein BM039 




Contig49169_RC 




-1.17 


1.5 


0.46 


SUV39H2 


suppressor of 
variegation 3-9 
(Drosophila) 
homolog 2; 
hypothetical protein 
FLJ23414 


Hypothetical protein, 
Nuclear protein 


Contig45032_RC 




-1.37 


0.77 


0.45 


FLJ14813 


hypothetical protein 
FLJ14813 


Hypothetical protein, 
ATP-binding, Kinase, 
Serine/threonine- 
protein kinase, 
Transferase 


NM_000917 


P4HA1 


-1.54 


4.31 


0.62 


P4HA1 


procollagen- 
proline, 2- 
oxoglutarate 4- 
dioxygenase 
(proline 4- 
hydroxylase), alpha 
polypeptide I 


Dioxygenase, 
Collagen, 

Oxidoreductase, Iron, 
Vitamin C, Alternative 
splicing, Glycoprotein, 
Endoplasmic 
reticulum, Signal 


NM_002046 


GAPD 


-2.51 


3.4; 


o.e 


GAPD 


glyceraldehyde-3- 

phosphate 

dehydrogenase 


Glycolysis, NAD, 
Oxidoreductase, 
Hypothetical protein, 
Multigene family 


NM_000365 


TPI1 


-1.8 


2.9' 


\ 0.5( 


JTPI1 


triosephosphate 
isomerase 1 


Fatty acid 

biosynthesis, 

Gluconeogenesis, 

Glycolysis, 

Isomerase, Pentose 

shunt, Disease 

mutation, 

Polymorphism, 3D- 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp_xref keyword li 

St 
















structure 


NM_014364 


GAPDS 


-1.08 


2.88 


0.58 


GAPDS 


glyceraldehyde-3- 
shosphate 
dehydrogenase, 
testis-specific 


Glycolysis, 

Dxidoreductase NAD 


NM_005566 


LDHA 


-2.01 


4.01 


0.59 


LDHA 


actate 

dehydrogenase A 


Oxidoreductase, 
MAD, Glycolysis, 
Multigene family, 
Disease mutation, 
Polymorphism 


NM 000291 












Dhosphoglycerate 
dnase 1 


Kinase, Transferase, 
Multigene family, 
Glycolysis, 
Acetylation, Disease 
mutation, 
Polymorphism, 
Hereditary hemolytic 
anemia 


NM_016185 


LOC511 
55 


-2.33 


2.82 


0.59 


HN1 


hematological and 
neurological 
expressed 1 




NM_001168 


BIRC5 


-4.33 


2.78 


0.55 


BIRC5 


baculoviral IAP 
repeat-containing 5 


Apoptosis, Thiol 
protease inhibitor, 
Alternative splicing, 
3D-structure, 
Hypothetical protein, 
Protease, Receptor 


NM_002266 


KPNA2 


-3.75 


1.34 


0.47 


KPNA2 


karyopherin alpha 
2 (RAG cohort 1, 
importin alpha 1) 


Transport, Protein 
transport, Repeat, 
Nuclear protein, 
Polymorphism 














ESTs, Weakly similar to hypothetical 
protein FLJ20489 [Homo sapiens] 
[H. sapiens] 


NM_000269 


NME1 


-2.15 


3.43 


0.55 


NME1 


non-metastatic 
cells 1 , protein 
(NM23A) 
expressed in 


Transferase, Kinase, 
ATP-binding, Nuclear 
protein, Anti- 
oncogene, Disease 
mutation I 


NM_003158 


STK6 


-1.23 


1.73 


0.45 


STK6 


serine/threonine 
Kinase 6 


ATP-binding, Kinase, 
Serine/threonine- 
protein kinase, 
Transferase 


NM_007274 


HBACH 


-1.83 


2.73 


0.51 


BACH 


brain acyl-CoA 
hydrolase 


Hydrolase, Serine 
esterase, Repeat 


Contig55188_RC 




-2.36 


3.28 


0.47 


FLJ22341 


hypothetical protein 
FLJ22341 


Hypothetical protein 


NM_002061 


GCLM 


-1.06 


1.76 


0.48 


GCLM 


glutamate-cysteine 
ligase, modifier 
subunit 


Ligase, Glutathione 
biosynthesis 


NM_004207 


SLC16A 
3 


-3.11 


5.07 


0.67 


SLC16A3 


solute carrier family 
16 (monocarboxylic 
acid transporters), 
member 3 


Transport, Symport, 
Transmembrane, 
Multigene family 


NM_000582 


SPP1 


-5.09 


5.47 


0.53 


SPP1 


secreted 

phosphoprotein 1 


Hypothetical protein, 
Glycoprotein, Sialic 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


-orrel- 
ation 


Sequence 
name 


Description 


Sp xref keyword II 

St 














osteopontin, bone 
sialoprotein I, early 
r-lymphocyte 
activation 1) 


acid, 

Biomineralization, Cell 
adhesion, 
Phosphorylation, 
Signal, Alternative 
splicing 


NM_001109 


ADAM8 


-2.5 


3.74 


0.45 


ADAM8 


a disintegrin and 
metalloproteinase 
domain 8 


Hydrolase, 

\Zletalloprotease, Zinc, 
Signal, Glycoprotein, 
Transmembrane, 
Antigen 


D50402 


SLC11A 
1 


-1.05 


3.46 


0.53 


SLC11A1 


solute carrier family 
11 (proton-coupled 
divalent metal ion 
transporters), 
member 1 


Transport, Iron 

transport, 

Transmembrane, 

Glycoprotein, 

Macrophage, 

Polymorphism 


AL080235 


DKFZP5 
86E162 
1 


-1.23 


1.96 


0.51 


RIS1 


Ras-induced 
senescence 1 


Hypothetical protein 


Contig40552_RC 




-1.26 


3.96 


0.54 


FLJ25348 


hypothetical protein 
FLJ25348 


Hypothetical protein 


Contig52490_RC 




-0.64 


3.33 


0.61 


LOC11623 
8 


hypothetical protein 
BC014072 




NM_006461 


DEEPE 
ST 


-2.1 


1.85 


0.46 


SPAG5 


sperm associated 
antigen 5 


Hypothetical protein 


Contig56503_RC 




-4.3 


3.39 


0.55 


MGC9753 


hypothetical gene 
MGC9753 


Hypothetical protein 


Contig63525 




-1.91 


3.34 


0.5 


FLJ13352 


hypothetical protein 
FLJ 13352 


Hypothetical protein 


NM_001909 


CTSD 


-0.83 


4.6 


0.51 


CTSD 


cathepsin D 
(lysosomal aspartyl 
protease) 


Hydrolase, Aspartyl 
protease, 
Glycoprotein, 
Lysosome, Signal, 
Zymogen, 
Polymorphism, 
Alzheimer's disease, 
3D-structure 


NM_005063 


SCD 


-2.57 


5.15 


0.48 


SCD 


stearoyl-CoA 
desaturase (delta- 
9-desaturase) 


Hypothetical protein, 
Endoplasmic 
reticulum, Fatty acid 
biosynthesis, Iron, 
Oxidoreductase, 
Transmembrane 


NM_005165 


ALDOC 


-2.43 


5.02 


0.48 


ALDOC 


aldolase C, 

fructose- 

bisphosphate 


Lyase, Schiff base, 
Glycolysis, Multigene 
family 


NM_000363 


TNNI3 


-0.5* 


3.5E 


0.46 


TNNI3 


troponin I, cardiac 


Hypothetical protein, 
Muscle protein, Actin- 
binding, Acetylation, 
Disease mutation, 
Cardiomyopathy, 
Receptor, Signal 


AF035284 




-1.6. 


» 3.2* 


i 0.4- 


FADS1 


EST 


Heme, Hypothetical 
protein 


Contig30875_RC 




-0.8 


3 


J O.C 




ESTs 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp_xref_keyword li 

St 


NM_018487 


HCA112 


-0.7 


3.54 


0.58 


HCA112 


hepatocellular 
carcinoma- 
associated antigen 
112 


Hypothetical protein 


NM_001323 


CST6 


-1.63 


3.84 


0.57 


CST6 


cystatin E/M 


Thiol protease 
inhibitor, Signal 
Glycoprotein 


NM_006516 


SLC2A1 


-1.66 


2.22 


0.46 


SLC2A1 


solute carrier family 

0 ffarilitateri 

glucose 
transporter), 
member 1 


Transmembrane, 
Sugar transport, 
Transport, 
Glycoprotein 
Multigene family, 
Disease mutation 


NM_007267 


LAK-4P 


-1.04 


3.28 


0.61 


EVIN1 


expressed in 
activated T/LAK 
lymphocytes 


Hypothetical protein 


NM_004710 


SYNGR 
2 


-0.84 


4.81 


0.56 


SYNGR2 


synaptogyrin 2 


Transmembrane 


Contig63649_RC 




-1.34 


6.3 


0.75 




ESTs, Weakly similar to 2004399A 
chromosomal protein [Homo sapiens] 
[H. sapiens] 


NM 003376 


VEGF 


-2.12 


2.42 






vascular 

endothelial growth 
factor 


Hypothetical protein, 
Mitogen, 

Angiogenesis, Growth 
factor, Glycoprotein, 
Signal, Heparin- 
binding, Alternative 
splicing, Multigene 
family, 3D-structure 


NM 000799 


EPO 


-0.75 








erythropoietin 


Erythrocyte 
maturation, 
Glycoprotein, 
Hormone, Signal, 
Pharmaceutical, 3D- 
structure 


NM_006014 


DXS987 
9E 


-1.85 


3.44 


0.54 


DXS9879E 


DNA segment on cr 
9879 expressed sec 


romosome X (unique) 
uence 


NM_007183 


PKP3 


-0.91 


4.14 


0.48 


PKP3 


plakophilin 3 


Cell adhesion, 
Cytoskeleton, 
Structural protein, 
Nuclear protein, 
Repeat 


D13642 


SF3B3 


-0.65 


2.28 


0.48 


SF3B3 


splicing factor 3b, 
subunit3, 130kDa 


Hypothetical protein, 
Spliceosome, mRNA 
processing, mRNA 
splicing, Nuclear 
protein 


NM_003756 


EIF3S3 


-1.85 


2.19 


0.46 


EIF3S3 


eukaryotic 
translation initiation 
factor 3, subunit 3 
gamma, 40kDa 


Initiation factor, 
Protein biosynthesis 


Contig47096_RC 




-0.41 


4.52 


0.54 


PFKFB4 


6-phosphofructo-2- 
kinase/fructose- 
2,6-biphosphatase 
4 


Kinase, 
Multifunctional 
enzyme, Transferase, 
Hydrolase, ATP- 
binding, i 
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Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp_xref_keyword_li 
st 
















Phosphorylation, 
Multigene family 


NM_004209 


SYNGR 
3 


-0.31 


3.67 


0.53 


SYNGR3 


synaptogyrin 3 


Transmembrane ; 


Contig3464_RC 




0.99 


-5.81 


-0.52 




ESTs 




Contig31646_RC 




1.1 


-7.76 


-0.5 


COL14A1 


collagen, type XIV, 
alpha 1 (undulin) 


Extracellular matrix, 
Glycoprotein, 
Hypothetical protein, 
Collagen, Signal 


Contig49388_RC 




1.73 


-1.75 


-0.51 


FLJ13322 


hypothetical protein 
FLJ 13322 


Hypothetical protein 


Contig41887_RC 




0.37 


-5.74 


-0.47 


LOC12422 
0 


similar to common 
salivary protein 1 


Hypothetical protein 



Table 8 Good and poor prognosis templates: mean subtracted log(intensity) values for 
each of the seventy markers for 44 breast cancer patients having a good 
prognosis (CI) or 34 breast cancer patients having a poor prognosis (C2) (see, 
5 PCT publication WO 2004/065545, published August 5, 2004, which is 

incorporated herein by reference in its entirety) 



Marker Accession 


CI 


C2 


SEQ ID NO 


# 


(good prognosis template) (poor prognosis template) 




AL080059 


-5.161569 


0.043019 


367 


Contig63649_RC 


-1.440895 


0.966702 


361 


Contig46218 RC 


-0.937662 


0.815081 


321 


NM 016359 


-1.49878 


0.872829 


253 


AA555029 RC 


-1.283504 


0.543442 


368 


NM 003748 


1.355486 


-0.254201 


369 


Contig38288 RC 


-1.237495 


1.085461 


297 


NM 003862 


0.981236 


-1.619658 


370 


Contig28552 RC 


-1.296043 


1.067545 


281 


Contig32125 RC 


0.855155 


-0.7338 


371 


U82987 


1.256206 


-1.362807 


372 


AL137718 


-0.55046 


0.68754 


373 


AB037863 


0.819061 


-1.621057 


374 


NM 020188 


-1.137582 


0.673123 


375 


NM 020974 


-0.463953 


-5.623268 


376 


NM 000127 


-0.618568 


0.552726 


377 


NM 002019 


-1.409168 


0.547285 


378 


NM 002073 


-1.577177 


0.417352 


379 


NM 000436 


-0.722574 


0.599239 


380 


NM 004994 


-5.561089 


-2.180659 


165 


Contig55377 RC 


0.805683 


-1.16728 


381 


Contig35251_RC 


-0.931146 


0.607562 


382 


Contig25991 


-0.720727 


1.045949 


383 


NM 003875 


-1.206839 


1.163244 


384 


NM 006101 


-0.879965 


0.628296 


385 


NM 003882 


0.529121 


-0.467098 


386 


NM 003607 


-0.959094 


0.709653 


387 
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Marker Accession 


CI 


C2 


SEQ ID NO 


# 


(good prognosis template) (poor prognosis template) 




AF073519 


-1.451486 


0.163988 


388 


AF052162 


-1.145575 


0.192391 


9 


NM 000849 


0.944742 


-1.499473 


389 


Contig32185 RC 


-0.887643 


0.688257 


390 


NM_016577 


-2.762008 


0.081637 


391 


Contig48328 RC 


0.405401 


-2.946904 


392 


Contig46223 RC 


0.805424 


-0.581849 


393 


NM 015984 


-1.056531 


0.522176 


394 


NM 006117 


1.129928 


-1.262974 


395 


AK000745 


-2.475715 


-0.013002 


396 


Contig40831 RC 


-1.17091 


0.435754 


397 


NM 003239 


0.457773 


-2.150499 


398 


NM 014791 


-1.14862 


0.383018 


399 


X05610 


-0.768514 


0.637938 


400 


NM 016448 


-0.713264 


0.632638 


401 


NM 018401 


0.618921 


-0.286778 


402 


NM_000788 


-0.995116 


0.50246 


403 


Contig51464 RC 


-0.663538 


0.765975 


404 


AL080079 


-1.794821 


0.43708 


405 


NM 006931 


-0.846271 


0.915602 


406 


AF257175 


1.122354 


-0.721924 


407 


NM 014321 


-1.820261 


0.482287 


241 


NM 002916 


-0.966852 


0.599925 


408 


Contig55725_RC 


-2.935162 


0.623397 


409 


Contig24252 RC 


-2.004671 


0.263597 


410 


AF201951 


0.355839 


-2.296556 


411 


NM 005915 


-0.586121 


0.827714 


412 


NM_001282 


0.762645 


-0.970418 


413 


Contig56457 RC 


-0.920808 


0.588269 


414 


NM 000599 


-3.612469 


-0.714313 


51 


NM 020386 


-0.46073 


0.699313 


415 


NM 014889 


-1.678462 


0.2362 


416 


AF055033 


-2.505271 


-0.07576 


11 


NM 006681 


-0.631302 


0.584119 


417 


NM_007203 


-1.426446 


0.504624 


418 


Contig63102 RC 


0.521511 


-1.266163 


419 


NM 003981 


-2.521877 


0.552669 


420 


Contig20217 RC 


-0.363574 


0.449022 


421 


NM 001809 


-2.171301 


0.328419 


81 


Contig2399 RC 


-1.174844 


0.602523 


422 


NM 004702 


-1.560133 


0.619078 


155 


NM 007036 


-0.950633 


0.34945 


423 


NM 018354 


-1.392354 


0.347831 


424 



5.2. DIAGNOSTIC AND PROGNOSTIC METHODS 
5.2.1 SAMPLE COLLECTION 
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In the present invention, markers, such as target polynucleotide molecules or proteins, 
are extracted from a sample taken from an individual afflicted with a condition such as breast 
cancer. The sample may be collected in any clinically acceptable manner, but must be 
collected such that marker-derived polynucleotides {i.e., RNA) are preserved (if gene 
expression is to be measured) or proteins are preserved (if encoded proteins are to be 
measured). For example, mRNA or nucleic acids derived therefrom {i.e., cDNA or amplified 
DNA) are preferably labeled distinguishably from standard or control polynucleotide 
molecules, and both are simultaneously or independently hybridized to a microarray 
comprising some or all of the markers or marker sets or subsets described above. 
0 Alternatively, mRNA or nucleic acids derived therefrom may be labeled with the same label 
as the standard or control polynucleotide molecules, wherein the intensity of hybridization of 
each at a particular probe is compared. A sample may comprise any clinically relevant tissue 
sample, such as a tumor biopsy or fine needle aspirate, or a sample of bodily fluid, such as 
blood, plasma, serum, lymph, ascitic fluid, cystic fluid, urine or nipple exudate. The sample 
5 may be taken from a human, or, in a veterinary context, from non-human animals such as 
ruminants, horses, swine or sheep, or from domestic companion animals such as felines and 
canines. 

Methods for preparing total and poly(A)+ RNA are well known and are described 
generally in Sambrook et al, Molecular Cloning - A Laboratory Manual (2nd Ed.), 
20 Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1989)) and 
Ausubel etal, Current Protocols in Molecular Biology, vol. 2, Current Protocols 
Publishing, New York (1 994)). 

RNA may be isolated from eukaryotic cells by procedures that involve lysis of the 
cells and denaturation of the proteins contained therein. Cells of interest include wild-type 
25 cells {i.e., non-cancerous), drug-exposed wild-type cells, tumor- or tumor-derived cells, 

modified cells, normal or tumor cell line cells, and drug-exposed modified cells. Preferably, 
the cells are breast cancer tumor cells. 

Additional steps may be employed to remove DNA. Cell lysis may be accomplished 
with a nonionic detergent, followed by microcentrifugation to remove the nuclei and hence 
30 the bulk of the cellular DNA. In one embodiment, RNA is extracted from cells of the various 
types of interest using guanidinium thiocyanate lysis followed by CsCl centrifugation to 
separate the RNA from DNA (Chirgwin et al, Biochemistry 18:5294-5299 (1979)). 
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Poly(A)+ RNA is selected by selection with oligo-dT cellulose {see Sambrook et al, 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York (1989). Alternatively, separation of RNA from 
DNA can be accomplished by organic extraction, for example, with hot phenol or 
phenol/chloroform/isoamyl alcohol. 

If desired, RNase inhibitors may be added to the lysis buffer. Likewise, for certain 
cell types, it may be desirable to add a protein denaturation/digestion step to the protocol. 

For many applications, it is desirable to preferentially enrich mRNA with respect to 
other cellular RNAs, such as transfer RNA (tRNA) and ribosomal RNA (rRNA). Most 
mRNAs contain a poly(A) tail at their 3' end. This allows them to be enriched by affinity 
chromatography, for example, using oligo(dT) or poly(U) coupled to a solid support, such as 
cellulose or Sephadex™ (see Ausubel et al., Current Protocols in Molecular Biology, 
vol. 2, Current Protocols Publishing, New York (1994). Once bound, poly(A)+ mRNA is 
eluted from the affinity column using 2 mM EDTA/0.1% SDS. 

The sample of RNA can comprise a plurality of different mRNA molecules, each 
different mRNA molecule having a different nucleotide sequence. In a specific embodiment, 
the mRNA molecules in the RNA sample comprise at least 5, 10, 15, 20, 25, 30, 40 or 50 
different nucleotide sequences. More preferably, the mRNA molecules of the RNA sample 
comprise mRNA molecules corresponding to each of the marker genes. In another specific 
embodiment, the RNA sample is a mammalian RNA sample. 

In a specific embodiment, total RNA or mRNA from cells is used in the methods of 
the invention. The source of the RNA can be cells of a plant or animal, human, mammal, 
primate, non-human animal, dog, cat, mouse, rat, bird, yeast, eukaryote, prokaryote, etc. In 
specific embodiments, the method of the invention is used with a sample containing total 
mRNA or total RNA from 1 x 10 6 cells or less. In another embodiment, proteins can be 
isolated from the foregoing sources, by methods known in the art, for use in expression 
analysis at the protein level. 

Probes to the homologs of the marker sequences disclosed herein can be employed 
preferably when non-human nucleic acid is being assayed. 
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The prognosis may be carried out using a cellular constituent profile of any type of 
molecular markers. For example, sets of proteins informative for breast cancer prognosis 
may be used. Such protein markers may be measured, for example, by use of gel 
electrophoresis, such as one-dimensional polyacrylamide gel electrophoresis, two- 

5 dimensional polyacrylamide gel electrophoresis, nondenaturing polyacrylamide gel 

electrophoresis; isoelectric focusing gels, etc., by use of antibody arrays, etc. Of course, the 
particular template(s) used to classify the individual depends upon the type(s) of cellular 
constituents used as markers. For example, where nucleic acids (e.g., genes or nucleic acids 
derived from expressed genes) are used as markers, the template comprises nucleic acids (or 

1 0 the level of expression or abundance thereof); where proteins are used as markers, the 
template comprises proteins, for example, the level or abundance of those proteins; etc. 

5.2.2. USE OF PROGNOSTIC GENESETS FOR BR EAST CANCER 
According to the present invention, once genesets informative for a plurality of 
subsets of a condition are identified, an individual is classified into one of these subsets and a 
1 5 prognosis is made based on the expression of the genes, or their encoded proteins, in the 
geneset for that subset in a breast cancer tumor sample taken from the individual. The 
patient's responsiveness to chemotherapy is then determined based on the patient's subset 
classification and the prognosis. Various methods are known in the art that can be used for 
classifying an expression profile. Exemplary methods are described in Section 5.2.4, infra. 

20 For example, a particular hypothetical condition has three relevant phenotypic 

characteristics, A, B and C. In this example, based on these characteristics, genesets 
informative for prognosis of four patient subsets A*B + ; A + B"C + ; A + B~C~; and A - are 
identified by the method described above. Thus, an individual having the condition would 
first be classified according to phenotypes A-C into one of the four patient subsets. In one 
25 embodiment, therefore, the invention provides for the classification of an individual having a 
condition into one of a plurality of patient subsets, wherein a set of genes informative for 
prognosis for the subset has been identified. A sample is then taken from the individual, and 
the levels of expression of the prognostically-informative genes in the sample is analyzed. In 
one embodiment, the expression level of each gene can be compared to the expression level 
30 of the corresponding gene in a control or reference sample to determine a differential 
expression level. The expression profile comprising expression levels, e.g., differential 
expression levels, of the plurality of genes is then compared to a template profile. In various 
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embodiments, the template profile is a good prognosis template comprising the average 
expression of informative genes in samples taken from good prognosis individuals 
classifiable into that patient subset; or a poor prognosis template comprising the average 
expression of informative genes in samples taken from poor prognosis individuals classifiable 
5 into that patient subset; or a good prognosis profile comprising a set of mathematical values 
that represent gene expression levels of good prognosis individuals classifiable into that 
patient subset; etc. 

In a specific embodiment, the phenotypic, genotypic and/or clinical classes for breast 
cancer are: ER - , BRCA1 individuals; ER~~, sporadic individuals; ER+, ER/AGE high 

10 individuals; ER+, ER/AGE low, LN+ individuals; and ER+, ER/AGE low, LN - individuals. 
In this embodiment, an individual may be classified as ER+ or ER~. If the individual is ER~ 
the individual is additionally classified as having a BRCA1 -type or sporadic tumor. ER~ 
individuals are thus classified as ER - BRCA1 or ER - sporadic. Alternatively, if the 
individual is classified as ER+, the individual is additionally classified as having a high or 

1 5 low ratio of the log (ratio) of the level of expression of the gene encoding the estrogen 

receptor to the individual's age. Individuals having a low ratio are additionally classified as 
LN+ or LN-. ER+ individuals are thus classified as ER+, ER/AGE high; ER+, ER/AGE low, 
LN+, or ER+, ER/AGE low, LN - Of course, the individual's ER status, tumor type, age and 
LN status may be identified in any order, as long as the individual is classified into one of 

20 these five subsets. 

In one embodiment, a breast cancer patient is first classified into one of the following 
patient subsets: ER - , BRCA1 individuals; ER~, sporadic individuals; ER+, ER/AGE high 
individuals; ER+, ER/AGE low, LN+ individuals; and ER+, ER/AGE low, LN - individuals. 
The patient is then classified as having a good prognosis or a poor prognosis by comparing 

25 the patient's expression profile of a plurality of genes or their encoded proteins to a good 
and/or poor prognosis template profile of expression levels of said plurality of genes or then- 
encoded proteins for the appropriate subset, and classifying said patient as having a good 
prognosis or poor prognosis if said patient expression profile has a high similarity to a good 
prognosis template or a poor prognosis template, respectively. In a preferred embodiment, 

30 the good prognosis template for a patient subset comprises the average level of expression of 
each of said plurality of genes informative for prognosis in tumor samples from individuals 
classified into said subset who have a good prognosis or good outcome, while the poor 
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prognosis template comprises the average level of expression of each of said plurality of 
genes informative for prognosis in tumor samples from individuals classified into said subset 
who have a poor prognosis or poor outcome. In another specific embodiment, said good or 
poor prognosis template is a set of mathematical values representing the average level of 
5 expression of genes informative for prognosis in tumor samples of individuals classifiable 
into said subset who have a good prognosis, or who have a poor prognosis, respectively. 

It is evident that the different patient subsets described herein reflect different 
molecular mechanisms of the initiation of tumor formation and metastasis. Thus, the 
genesets listed in tables 1-5 are also useful for diagnosing a person as having a particular type 

10 of breast cancer in the first instance. Thus, the invention also provides a method of 
diagnosing an individual as having a particular subtype of breast cancer, comprising 
determining an expression profile of a plurality of the genes for which markers are listed in 
Tables 1-5 in a sample from said individual; and comparing said expression profile to a 
template profile, where said template is representative of the expression of said plurality of 

1 5 genes in a breast cancer sample of said subtype of cancer, and on the basis of said 

comparison, diagnosing the individual as having said subtype of breast cancer. In a specific 
embodiment, said subtype of cancer is selected from the group consisting of ER~, BRCA1 
type; ER~ sporadic type; ER+, ER/AGE high type; ER+, ER/AGE low, LN+ type; and 
ER/AGE low, LN~ type. In another specific embodiment, said template comprises the 

20 average levels of expression of a plurality of the genes for which markers are listed in Table 
1, Table 2, Table 3, Table 4 or Table 5. In another specific example, said comparing 
comprises determining the similarity of the expression profile of the genes for which markers 
are listed in each of Tables 1-5 in said sample taken from said individual to a template profile 
comprising levels of expression of the same genes for each of Tables 1-5, and determining 

25 whether the pattern of expression of said genes in said sample is most similar to the pattern of 
expression of the genes for which markers are listed in Table 1, Table 2, Table 3, Table 4 or 
Table 5. 

In another embodiment, a breast cancer patient is classified as having a good 
prognosis or a poor prognosis by a method comprising: (a) classifying said patient as ER~, 
30 BRCA1; ER~ sporadic; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE 
low, LN~; (b) determining an expression profile of a first plurality of genes in a cell sample 
taken from the patient, said first plurality of genes comprising at least two of the genes 
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corresponding to the markers Table 1 if said patient is classified as ER~ BRCA1; Table 2 if 
said patient is classified as ER - , sporadic; Table 3 if said patient is classified as ER+, 
ER/AGE high; Table 4 if said patient is classified as ER+, ER/AGE low, LN+; or Table 5 if 
said patient is classified as ER+, ER/AGE low, LN~, wherein said patient is "ER/AGE high" 

5 if the ER level of the patient exceeds a predetermined value, and "ER/AGE low" if the ER 
level of the patient does not exceed said predetermined value. In a specific embodiment of 
this method, said predetermined value of ER is calculated as ER = 0.1 (AGE - 42.5), wherein 
AGE is the age of said patient. In another specific embodiment, said patient is ER - BRCA1, 
and said plurality of genes comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which 

'0 markers are listed in Table 1 . In another specific embodiment, said patient is ER - , sporadic, 
and said plurality of genes comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which 
markers are listed in Table 2. In another specific embodiment, said patient is ER+, ER/AGE 
high, and said plurality of genes comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for 
which markers are listed in Table 3. In another specific embodiment, said patient is ER+, 

15 ER/AGE low, LN+, and said plurality of genes comprises at least 1, 2, 3, 4, 5, 10 or all of the 
genes for which markers are listed in Table 4. In another specific embodiment, said patient is 
ER+, ER/AGE low, LN"~, and said plurality of genes comprises at least 1, 2, 3, 4, 5, 10 or all 
of the genes for which markers are listed in Table 5. In another specific embodiment, the 
method additionally comprises determining in said cell sample the level of expression, 

>0 relative to a control, of a second plurality of genes for which markers are not found in Tables 
1-5, wherein said second plurality of genes is informative for prognosis. 

Where information is available regarding the LN status of a breast cancer patient, the 
patient may be identified as having a "very good prognosis," an "intermediate prognosis," or 
a poor prognosis, which enables the refinement of treatment. In one embodiment, the 

25 invention provides a method of assigning a therapeutic regimen to a breast cancer patient, 
comprising: (a) classifying said patient as having a "poor prognosis," "intermediate 
prognosis," or "very good prognosis" on the basis of the levels of expression of at least five 
genes for which markers are listed in Table 1, Table 2, Table 3, Table 4 or Table 5; and (b) 
assigning said patient a therapeutic regimen, said therapeutic regimen (i) comprising no 

30 adjuvant chemotherapy if the patient is lymph node negative and is classified as having a 
good prognosis or an intermediate prognosis, or (ii) comprising chemotherapy if said patient 
has any other combination of lymph node status and expression profile. 
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In another embodiment, a breast cancer patient is assigned a prognosis by a method 
comprising (a) determining the breast cancer patient's age, ER status, LN status and tumor 
type; (b) classifying said patient as ER~, sporadic; ER", BRCA1; ER+, ER/AGE high; ER+, 
ER/AGE low, LN+; or ER+, ER/AGE low, LIST; (c) determining an expression profile 
5 comprising at least five genes in a cell sample taken from said breast cancer patient wherein 
markers for said at least five genes are listed in Table 1 if said patient is classified as ER - , 
sporadic; Table 2 if said patient is classified as ER - , BRCA1; Table 3 if said patient is 
classified as ER+, ER/AGE high; Table 4 if said patient is classified as ER+, ER/AGE low, 
LN+; or Table 5 if said patient is classified as ER+, ER/AGE high, LN - ; (d) determining the 
10 similarity of the expression profile of said at least five genes to a template profile comprising 
levels of expression of said at least five genes to obtain a patient similarity value; (e) 
comparing said patient similarity value to selected first and second threshold values of 
similarity, respectively, wherein said second similarity threshold indicates greater similarity 
to said template expression profile than does said first similarity threshold; and (f) classifying 
1 5 said breast cancer patient as having a first prognosis if said patient similarity value exceeds 
said second threshold similarity values, a second prognosis if said patient similarity value 
exceeds said first threshold similarity value but does not exceed said second threshold 
similarity value, and a third prognosis if said patient similarity value does not exceed said 
first threshold similarity value. In a specific embodiment of the method, said first prognosis 
20 is a "very good prognosis," said second prognosis is an "intermediate prognosis," and said 
third prognosis is a "poor prognosis," wherein said breast cancer patient is assigned a 
therapeutic regimen comprising no adjuvant chemotherapy if the patient is lymph node 
negative and is classified as having a good prognosis or an intermediate prognosis, or 
comprising chemotherapy if said patient has any other combination of lymph node status and 
25 expression profile. 

The invention also provides a method of assigning a therapeutic regimen to a breast 
cancer patient, comprising: (a) determining the lymph node status for said patient; (b) 
determining the expression of at least five genes for which markers are listed in Table 5 in a 
cell sample from said patient, thereby generating an expression profile; (c) classifying said 
30 patient as having a "poor prognosis," "intermediate prognosis," or "very good prognosis" on 
the basis of said expression profile; and (d) assigning said patient a therapeutic regimen, said 
therapeutic regimen comprising no adjuvant chemotherapy if the patient is lymph node 
negative and is classified as having a good prognosis or an intermediate prognosis, or 
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comprising chemotherapy if said patient has any other combination of lymph node status and 
classification. In a specific embodiment of this method, said therapeutic regimen assigned to 
lymph node negative patients classified as having an "intermediate prognosis" additionally 
comprises adjuvant hormonal therapy. In another specific embodiment of this method, said 

5 classifying step (c) is carried out by a method comprising: (a) rank ordering in descending 
order a plurality of breast cancer tumor samples that compose a pool of breast cancer tumor 
samples by the degree of similarity between the expression profile of said at least five genes 
in each of said tumor samples and the expression profile of said at least five genes across all 
remaining tumor samples that compose said pool, said degree of similarity being expressed as 

[ 0 a similarity value; (b) determining an acceptable number of false negatives in said classifying 
step, wherein a false negative is a breast cancer patient for whom the expression levels of said 
at least five genes in said cell sample predicts that said breast cancer patient will have no 
distant metastases within the first five years after initial diagnosis, but who has had a distant 
metastasis within the first five years after initial diagnosis; (c) determining a similarity value 

1 5 above which in said rank ordered list said acceptable number of tumor samples or fewer are 
false negatives; (d) selecting said similarity value determined in step (c) as a first threshold 
similarity value; (e) selecting a second similarity value, greater than said first similarity 
value, as a second threshold similarity value; and (f) determining the similarity between the 
expression profile of said at least five genes in a breast cancer tumor sample from the breast 

10 cancer patient and the expression profile of said respective at least five genes in said pool, to 
obtain a patient similarity value, wherein if said patient similarity value equals or exceeds 
said second threshold similarity value, said patient is classified as having a "very good 
prognosis"; if said patient similarity value equals or exceeds said first threshold similarity 
value, but is less than said second threshold similarity value, said patient is classified as 

>5 having an "intermediate prognosis"; and if said patient similarity value is less than said first 
threshold similarity value, said patient is classified as having a "poor prognosis." Another 
specific embodiment of this method comprises determining the estrogen receptor (ER) status 
of said patient, wherein if said patient is ER positive and lymph node negative, said 
therapeutic regimen assigned to said patient additionally comprises adjuvant hormonal 

30 therapy. 

The invention provides a method of predicting the responsiveness of a breast cancer 
patient to chemotherapy based on the patient's cellular constituent profile comprising 
measurements of a plurality of cellular constituents in a sample derived from the patient, the 



-48- 




WO 2006/084272 PCT/LS2006/004280 

patient's estrogen receptor level ("ER level"), and the patient's estrogen receptor level 
relative to the patient's age ("ER/AGE"). In the method of the invention, a patient is 
classified into an appropriate chemotherapy responsiveness group as compared to patients in 
the general population of breast cancer patients. The inventors have discovered that a patient 

5 exhibits enhanced response to chemotherapy, e.g., to treatment by a combination of 

cyclophosphamide, methotrexate, and 5-fluorouracil (the "CMF combination"), as compared 
to patients in the general population of breast cancer patients if the patient's cellular 
constituent profile indicates poor prognosis, and the patient's ER level is ER , and the 
patient's estrogen receptor level relative to the patient's age ER/AGE is low, whereas the 

10 patient exhibits reduced response to chemotherapy as compared to patients in the general 
population if (i) the patient's ER level is ER", or (ii) the patient's cellular constituent profile 
indicates poor prognosis, and the patient's ER level is ER + , and the patient's estrogen 
receptor level relative to the patient's age ER/AGE is high. Here, ER + designates a high ER 
level and ER" designates a low ER level. In one embodiment, the ER levels are measured by 

1 5 immunohistochemical staining with tissue samples, and those patients whose tissue samples 
show greater than 10% of nuclei showing staining is deemed ER + and less than 10% of nuclei 
showing staining is deemed ER". In another embodiment, ER+ and ER- patients are 
separated by the transcript or mRNA level of a gene encoding the estrogen receptor measured 
using a microarray, with ER+ patients having logl0(ratio) > -0.65, and ER- patients having 

20 logl0(ratio) = or <-0.65, where the ratio is the ratio of the estrogen receptor mRNA level of 
the patient sample and a control sample, e.g., a pool of cellular constituents from a plurality 
of different breast tumor samples. Preferably, the plurality of samples comprises at least 50, 
100, 200, or 300 different samples. In addition, the inventors have also found that if the 
patient's cellular constituent profile indicates a good prognosis, then the patient does not need 

25 chemotherapy. The method of the invention is particularly useful for predicting the 

responsiveness of a breast cancer patient under the age of 55 to chemotherapy, e.g., treatment 
by the CMF combination. 

In one embodiment, the patient's cellular constituent profile is evaluated to determine 
whether the profile indicates good prognosis or poor prognosis. In a preferred embodiment, 
30 the patient's prognosis is evaluated by comparing the cellular constituent profile to a 

predetermined cellular constituent template profile corresponding to a certain prognosis level, 
e.g., a good prognosis template comprising measurements of the plurality of cellular 
constituents which are representative of levels of the cellular constituents in a plurality of 
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good outcome patients or a poor prognosis template comprising measurements of the 
plurality of cellular constituents which are representative of levels of the cellular constituents 
in a plurality of poor outcome patients. In a preferred embodiment, the good prognosis 
. template comprises average levels of the respective cellular constituents in a plurality of good 

5 outcome patients. In a preferred embodiment, the poor prognosis template comprises average 
levels of the respective cellular constituents in a plurality of poor outcome patients. In one 
embodiment, the average level of each cellular constituent in the good prognosis or poor 
prognosis template is a simple average. In another embodiment, the average level of each 
cellular constituent in the good prognosis or poor prognosis template is an error-weighted 

10 average. Herein a good outcome patient is a patient who has non-occurrence of metastases 
within a period of time after initial diagnosis, e.g., a period of 1 , 2, 3, 4, 5 or 10 years, and a 
poor outcome patient is a patient who has occurrence of metastases within a period of time 
after initial diagnosis, e.g., a period of 1, 2, 3, 4, 5 or 10 years. In a preferred embodiment, 
both periods are 10 years. Table 8 shows exemplary good (column Ci) and poor (column C2) 

1 5 template profiles for the 70-gene marker set. 

The degree of similarity of the patient's cellular constituent profile to a template 
representing good or poor prognosis can be used to indicate whether the patient has good or 
poor prognosis. In a preferred embodiment, a patient is classified as having a good prognosis 
profile if the patient's cellular constituent profile has a high similarity to a good prognosis 

20 template and/or has a low similarity to a poor prognosis template. In another embodiment, a 
patient is classified as having a poor prognosis profile if the patient's cellular constituent 
profile has a low similarity to a good prognosis template and/or has a high similarity to a poor 
prognosis template. In embodiments for predicting the responsiveness of a breast cancer 
patient under the age of 55, the patients in the good and poor outcome patient populations 

25 used to generate the templates are preferably also under the age of 55 at the time of diagnosis 
of breast cancer. 

The degree of similarity between a patient's cellular constituent profile and a template 
profile can be determined using any method known in the art. In one embodiment, the 
similarity is represented by a correlation coefficient between the patient's profile and the 
30 template. In one embodiment, a correlation coefficient above a correlation threshold 

indicates high similarity, whereas a correlation coefficient below the threshold indicates low 
similarity. In preferred embodiments, the correlation threshold is set as 0.3, 0.4, 0.5 or 0.6. 
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In another embodiment, similarity between a patient's profile and a template is represented 
by a distance between the patient's profile and the template. In one embodiment, a distance 
below a given value indicates high similarity, whereas a distance equal to or greater than the 
given value indicates low similarity. 

5 As an illustration, in one embodiment, a template for a good prognosis is defined as 

c, (e.g., a profile consisting of the values listed in the good prognosis d column of Table 8) 
and/or a template for poor prognosis is defined as c 2 (e.g., a profile consisting of the values 
listed in the good prognosis C 2 column of Table 8). Either one or both of the two classifier 
parameters ( ?, and P 2 ) can then be used to measure degrees of similarities between a 
10 patient's profile and the templates: P v measures the similarity between the patient's profile y 
and the good prognosis template c, , and P 2 measures the similarity between y and the poor 
prognosis template c 2 . In embodiments which employ correlation coefficients, the 
correlation coefficients P, can be calculated as: 



15 where /= 1 and 2. 

Thus, in one embodiment, y is classified as a good prognosis profile if P x is greater than a 
selected correlation threshold or if P 2 is equal to or less than a selected correlation threshold. 
In another embodiment, y is classified as a poor prognosis profile if P L is less than a selected 
correlation threshold or if P 2 is above a selected correlation threshold. In still another 
20 embodiment, y is classified as a good prognosis profile if /», is greater than a first selected 
correlation threshold and y is classified as a poor prognosis profile if P 2 is greater than a 
second selected correlation threshold. 

Other methods known in the art, e.g., the methods described in Section 5.2.4, infra, 
can also be used for classifying an expression profile according to prognosis. 

25 In a preferred embodiment, the cellular constituent profile is an expression profile 

comprising measurements of a plurality of transcripts (measured e.g., by measuring mRNAs 
or cDNA) in a sample derived from a patient. In this embodiment, the good prognosis 
template can be a good prognosis expression template comprising measurements of the 
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plurality of transcripts which are representative of expression levels of the transcripts in a 
plurality of good outcome patients, and the poor prognosis template can be a poor prognosis 
expression template comprising measurements of the plurality of transcripts which are 
representative of expression levels of the transcripts in a plurality of poor outcome patients. 

5 In a preferred embodiment, measurement of each transcript in the good or poor prognosis 
expression template is an average of expression levels of the transcript in the plurality of 
good or poor outcome patients, respectively. In one embodiment, each measurement is a 
mean subtracted log(intensity) (see, PCT publication WO 2004/065545, published August 5, 
2004, which is incorporated herein by reference in its entirety). In one embodiment, the 

1 0 plurality of transcripts consists of transcripts corresponding to at least a portion of the set of 
genes listed in Table 8. Preferably, the plurality of transcripts consists of transcripts 
corresponding to at least 50%, 60%, 70%, 80%, 85%, 90%, 95% or 98% of the set of genes 
listed in Table 8, or at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, or 65 of the genes 
listed in Table 8. In a preferred embodiment, the plurality of transcripts consists of 

1 5 transcripts corresponding to all the genes listed in Table 8. 

In another embodiment, the expression profile is a differential expression profile 
comprising differential measurements of said plurality of transcripts in a sample derived from 
said patient versus measurements of said plurality of transcripts in a control sample. The 
differential measurements can be xdev, log(ratio), error-weighted log(ratio), or a mean 
20 subtracted log(intensity) (see, e.g., PCT publication WO 00/39339, published on July 6, 
2000; PCT publication WO 2004/065545, published August 5, 2004, each of which is 
incorporated herein by reference in its entirety). 

In another embodiment, the cellular constituent profile comprises measurements of a 
plurality of protein species in a sample derived from the patient. In this embodiment, a good 
25 prognosis template comprises measurements of the plurality of protein species which are 

representative of levels of the protein species in a plurality of good outcome patients, and the 
poor prognosis template comprises measurements of the plurality of protein species which are 
representative of levels of the protein species in the plurality of poor outcome patients. 

The patient's cellular constituent profile can be determined by any method known in 
30 the art, e.g., as described in Section 5.3. 
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The methods of the invention preferably use a control or reference sample, which can 
be any suitable sample against which changes in cellular constituents can be determined. In 
one embodiment, the control or reference sample is generated by pooling together the 
plurality of cellular constituents, e.g., a plurality of transcripts or cDNAs, or a plurality of 
5 protein species, from a plurality of breast cancer patients. Alternatively, the control or 
reference sample can be generated by pooling together purified or synthesized cellular 
constituents, e.g., a plurality of purified or synthesized transcripts or cDNAs, a plurality of 
purified or synthesized protein species. In one embodiment, synthetic RNAs for each 
transcripts or cDNAs are pooled to form the control or reference sample. Preferably, the 

1 0 abundances of synthetic RNAs are approximately the abundances of the corresponding 
transcripts in a real tumor pool. The differential expression of marker genes for each 
individual patient sample is measured against this control sample. In one embodiment, 60- 
mer oligonucleotides corresponding to the probe sequences on a microarray used to assay the 
expression levels of the diagnostic/prognostic transcripts are synthesized and cloned into 

1 5 pBluescript SK- vector (Statagene, La Mia, CA), adjacent to the T7 promotor sequence. 
Individual clones are isolated, and the sequences of their inserts are verified by DNA 
sequencing. To generate synthetic RNAs, clones are linearized with £coRI and a T7 in vitro 
transcription (TVT) reaction is performed by MegaScript kit (Ambion, Austin, TX), followed 
by DNase treatment of the product. Synthetic RNAs are purified on RNeasy columns 

20 (Qiagen, Valencia, CA). These synthetic RNAs are transcribed, amplified, labeled, and 
mixed together to make the reference pool. The abundance of those synthetic RNAs are 
chosen to approximate the abundances of the transcripts of the corresponding marker genes in 
the real tumor pool. 

In one embodiment, the ER level is determined by measuring an expression level of a 
25 gene encoding the estrogen receptor in the patient relative to that in a control sample. In a 
preferred embodiment, the gene encoding the estrogen receptor is the estrogen receptor a 
gene. In another embodiment, the ER level is measured by an oligonucleotide probe that 
detects the transcript of the gene having accession number NM_000125 (SEQ ID NO: 425). 
The control sample is a pool of breast cancer cells from different patients. In this 
30 embodiment, the ER level is classified as ER + if the log(ratio) > -0.65, and is classified as 
ER otherwise (see, e.g., van't Veer et al., 2002, Nature 415:530). In another embodiment, 
the ER level is determined by immunohistochemical staining measurement of the level of the 
estrogen receptors in a tissue sample of the patient. In one embodiment, the ER level is 
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classified as ER + if immunohistochemical staining shows greater than 10% of nuclei showing 
staining and the ER level is classified as ER", if less than 10% nuclei of showing staining. 

The patient's estrogen receptor level relative to the patient's age can be measured 
using an appropriate metric designated as ER/AGE. If the individual's ER level is high 

5 relative to the individual's age, e.g., if the expression level of an estrogen receptor gene of the 
individual is higher than an age-dependent threshold value, the individual is classified as 
ER/AGE high. In one embodiment, the log(ratio) of ER expression of an individual of a 
particular age is compared with the threshold value for the individual's age, and an ER 
expression level equal to or greater than the threshold value is deemed high. In one 

1 0 embodiment, the patient's estrogen receptor level as measured by logl0(ratio) is compared to 
a parameter defined as c-(AGE - d), where c is a coefficient, AGE is the age of said patient, 
and d is an age threshold. The patient is classified as having high ER/AGE if the patient's 
ER level is greater than c (AGE - d), and the patient is classified as having low ER/AGE if 
the patient's ER level is equal to or less than c(AGE - d). The parameters c and d depend on 

15 the ER level and AGE used. They can be determined by fitting patients' ER level-age 

distribution to a bimodal distribution of two subgroups each having a different ER level-age 
dependence. In a preferred embodiment, c = 0. 1 and d = 42.5. 

Chemotherapy can be performed using any one or a combination of the anti-cancer 
drugs known in the art, including but are not limited to any topoisomerase inhibitor, DNA 
20 binding agent, anti-metabolite, ionizing radiation, or a combination of two or more of such 
known DNA damaging agents. 

A topoisomerase inhibitor that can be used in conjunction with the invention can be a 
topoisomerase I (Topo I) inhibitor, a topoisomerase II (Topo II) inhibitor, or a dual 
topoisomerase I and II inhibitor. A topo I inhibitor can be from any of the following classes 

25 of compounds: camptothecin analogue (e.g., karenitecin, aminocamptothecin, lurtotecan, 

topotecan, irinotecan, BAY 56-3722, rubitecan, GI14721, exatecan mesylate), rebeccamycin 
analogue, PNU 166148, rebeccamycin, TAS-103, camptothecin (e.g., camptothecin 
polyglutamate, camptothecin sodium), intoplicine, ecteinascidin 743, J-107088, pibenzimol. 
Examples of preferred topo I inhibitors include but are not limited to camptothecin, topotecan 

30 (hycaptamine), irinotecan (irinotecan hydrochloride), belotecan, or an analogue or derivative 
thereof. 
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A topo II inhibitor that can be used in conjunction with the invention can be from any 
of the following classes of compounds: anthracycline antibiotics (e.g., carubicin, pirarubicin, 
daunorubicin citrate liposomal, daunomycin, 4-iodo-4-doxydoxorubicin, doxorubicin, n,n- 
dibenzyl daunomycin, morpholinodoxorubicin, aclacinomycin antibiotics, duborimycin, 

5 menogaril, nogalamycin, zorubicin, epirubicin, marcellomycin, detorubicin, annamycin, 7- 
cyanoquinocarcinol, deoxydoxorubicin, idarubicin, GPX-100, MEN- 1075 5, valrubicin, 
KRN5500), epipodophyllotoxin compound (e.g., podophyllin, teniposide, etoposide, GL331, 
2-ethylhydrazide), anthraquinone compound (e.g., ametantrone, bisantrene, mitoxantrone, 
anthraquinone), ciprofloxacin, acridine carboxamide, amonafide, anthrapyrazole antibiotics 

1 0 (e.g., teloxantrone, sedoxantrone trihydrochloride, piroxantrone, anthrapyrazole, 
losoxantrone), TAS-103, fostriecin, razoxane, XK469R, XK469, chloroquinoxaline 
sulfonamide, merbarone, intoplicine, elsamitrucin, CI-921, pyrazoloacridine, elliptinium, 
amsacrine. Examples of preferred topo II inhibitors include but are not limited to 
doxorubicin (Adriamycin), etoposide phosphate (etopofos), teniposide, sobuzoxane, or an 

1 5 analogue or derivative thereof. 

DNA binding agents that can be used in conjunction with the invention include but 
are not limited to DNA groove binding agent, e.g., DNA minor groove binding agent; DNA 
crosslinking agent; intercalating agent; and DNA adduct forming agent. A DNA minor 
groove binding agent can be an anthracycline antibiotic, mitomycin antibiotic (e.g., 
20 porfiromycin, KW-2149, mitomycin B, mitomycin A, mitomycin C), chromomycin A3, 
carzelesin, actinomycin antibiotic (e.g., cactinomycin, dactinomycin, actinomycin Fl), 
brostallicin, echinomycin, bizelesin, duocarmycin antibiotic (e.g., KW 2189), adozelesin, 
olivomycin antibiotic, plicamycin, zinostatin, distamycin, MS-247, ecteinascidin 743, 
amsacrine, anthramycin, and pibenzimol, or an analogue or derivative thereof. 

25 DNA crosslinking agents include but are not limited to antineoplastic alkylating 

agent, methoxsalen, mitomycin antibiotic, psoralen. An antineoplastic alkylating agent can 
be a nitrosourea compound (e.g., cystemustine, tauromustine, semustine, PCNU, 
streptozocin, SarCNU, CGP-6809, carmustine, fotemustine, methylnitrosourea, nimustine, 
ranimustine, ethylnitrosourea, lomustine, chlorozotocin), mustard agent (e.g., nitrogen 

30 mustard compound, such as spiromustine, trofosfamide, chlorambucil, estramustine, 2,2,2- 
trichlorotriethylarnine, prednimustine, novembichin, phenamet, glufosfamide, peptichemio, 
ifosfamide, defosfamide, nitrogen mustard, phenesterin, mannomustine, cyclophosphamide, 
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melphalan, perfosfamide, mechlorethamine oxide hydrochloride, uracil mustard, bestrabucil, 
DHEA mustard, tallimustine, mafosfamide, aniline mustard, chlornaphazine; sulfur mustard 
compound, such as bischloroethylsulfide; mustard prodrug, such as TLK286 and ZD2767), 
emylenimine compound (e.g., mitomycin antibiotic, ethylenimine, uredepa, thiotepa, 
5 diaziquone, hexamethylene bisacetamide, pentamethylmelamine, altretamine, carzinophilin, 
triaziquone, meturedepa, benzodepa, carboquone), alkylsulfonate compound (e.g., 
dimethylbusulfan, Yoshi-864, improsulfan, piposulfan, treosulfan, busulfan, hepsulfam), 
epoxide compound (e.g., anaxirone, mitolactol, dianhydrogalactitol, teroxirone), 
miscellaneous alkylating agent (e.g., ipomeanol, carzelesin, methylene dimethane sulfonate, 
1 0 mitobronitol, bizelesin, adozelesin, piperazinedione, VNP40 1 0 1 M, asaley, 6- 

hydroxymethylacylfulvene, E09, etoglucid, ecteinascidin 743, pipobroman), platinum 
compound (e.g., ZD0473, liposomal-cisplatin analogue, satraplatin, BBR 3464, spiroplatin, 
onnaplatin, cisplatin, oxaliplatin, carboplatin, lobaplatin, zeniplatin, iproplatin), triazene 
compound (e.g., imidazole mustard, CB 10-277, mitozolomide, temozolomide, procarbazine, 
1 5 dacarbazine), picoline compound (e.g., penclomedine), or an analogue or derivative thereof. 
Examples of preferred alkylating agents include but are not limited to cisplatin, 
dibromodulcitol, fotemustine, ifosfamide (ifosfamid), ranimustine (ranomustine), nedaplatin 
(latoplatin), bendamustine (bendamustine hydrochloride), eptaplatin, temozolomide 
(methazolastone), carboplatin, altretamine (hexamethylmelamine), prednimustine, oxaliplatin 
20 (oxalaplatinum), carmustine, thiotepa, leusulfon (busulfan), lobaplatin, cyclophosphamide, 
bisulfan, melphalan, and chlorambucil, or analogues or derivatives thereof. 

Intercalating agents can be an anthraquinone compound, bleomycin antibiotic, 
rebeccamycin analogue, acridine, acridine carboxamide, amonafide, rebeccamycin, 
anthrapyrazole antibiotic, echinomycin, psoralen, LU 79553, BW A773U, crisnatol mesylate, 
25 benzo(a)pyrene-7,8-diol-9,l 0-epoxide, acodazole, elliptinium, pixantrone, or an analogue or 
derivative thereof. 

DNA adduct forming agents include but are not limited to enediyne antitumor 
antibiotic (e.g., dynemicin A, esperamicin Al, zinostatin, dynemicin, calicheamicin gamma 
II), platinum compound, carmustine, tamoxifen (e.g., 4-hydroxy-tamoxifen), psoralen, 
30 pyrazine diazohydroxide, benzo(a)pyrene-7,8-diol-9, 1 0-epoxide, or an analogue or derivative 
thereof. 
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Anti-metabolites include but are not limited to cytosine, arabinoside, floxuridine, 
fluorouracil, mercaptopurine, Gemcitabine, and methotrexate (MTX). 

In a specific embodiment, a KSP inhibitor (lS)-l-{[(2S)-4-(2,5-difluorophenyl)-2- 
phenyl-2,5-dihydro-lH-pyrrol-l-yl]carbonyl}-2-methylpropylamine (see, PCT application 
5 PCTVUS03/1 8482, filed June 12, 2003, which is incorporated herein by reference in its 
entirety), is used alone or in combination with other anti-cancer drugs for chemotherapy. 

In another specific embodiment, the chemotherapy is carried out using the CMF 
combination consisting of cyclophosphamide, methotrexate, and 5-fluorouracil. 

5.2.3. IMPROVING SENSITIVITY TO EXPRESSION LEVEL DIFFERENCES 
10 In using the markers disclosed herein, and, indeed, using any sets of markers, e.g., to 

compare profiles or to differentiate an individual having one phenotype from another 
individual having a second phenotype, one can compare the profile comprising absolute 
expression levels of the markers in a sample to a template; for example, a template 
comprising the average levels of expression of the markers in a plurality of individuals. To 
1 5 increase the sensitivity of the comparison, however, the expression level values are preferably 
transformed in a number of ways. Also, to differentiate an individual having one phenotype 
from another individual having a second phenotype using any sets of markers, one can 
compare the absolute expression of each of the markers in a sample to a control; for example, 
the control can be the average level of expression of each of the markers, respectively, in a 
20 pool of individuals. 

For example, the expression level of each of the markers can be normalized by the 
average expression level of all markers the expression level of which is determined, or by the 
average expression level of a set of control genes. Thus, in one embodiment, the markers are 
represented by probes on a microarray, and the expression level of each of the markers is 

25 normalized by the mean or median expression level across all of the genes represented on the 
microarray, including any non-marker genes. In a specific embodiment, the normalization is 
carried out by dividing the median or mean level of expression of all of the genes on the 
microarray. In another embodiment, the expression levels of the markers is normalized by 
the mean or median level of expression of a set of control markers. In a specific 

30 embodiment, the control markers comprise a set of housekeeping genes. In another specific 
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embodiment, the normalization is accomplished by dividing by the median or mean 
expression level of the control genes. 

The sensitivity of a marker-based assay will also be increased if the expression levels 
of individual markers are compared to the expression of the same markers in a pool of 

5 samples. Preferably, the comparison is to the mean or median expression level of each the 
marker genes in the pool of samples. Such a comparison may be accomplished, for example, 
by dividing by the mean or median expression level of the pool for each of the markers from 
the expression level each of the markers in the sample. This has the effect of accentuating the 
relative differences in expression between markers in the sample and markers in the pool as a 

10 whole, making comparisons more sensitive and more likely to produce meaningful results 
that the use of absolute expression levels alone. The expression level data may be 
transformed in any convenient way; preferably, the expression level data for all is log 
transformed before means or medians are taken. 

In performing comparisons to a pool, two approaches may be used. First, the 
1 5 expression levels of the markers in the sample may be compared to the expression level of 
those markers in the pool, where nucleic acid derived from the sample and nucleic acid 
derived from the pool are hybridized during the course of a single experiment. Such an 
approach requires that new pool nucleic acid be generated for each comparison or limited 
numbers of comparisons, and is therefore limited by the amount of nucleic acid available. 
20 Alternatively, and preferably, the expression levels in a pool, whether normalized and/or 
transformed or not, are stored on a computer, or on computer-readable media, to be used in 
comparisons to the individual expression level data from the sample (i.e., single-channel 
data). 

The current invention also provides the following method of classifying a first cell or 
25 organism as having one of at least two different phenotypes, where the different phenotypes 
comprise a first phenotype and a second phenotype. The level of expression of each of a 
plurality of markers in a first sample from the first cell or organism is compared to the level 
of expression of each of said markers, respectively, in a pooled sample from a plurality of 
cells or organisms, the plurality of cells or organisms comprising different cells or organisms 
30 exhibiting said at least two different phenotypes, respectively, to produce a first compared 
value. The first compared value is then compared to a second compared value, wherein said 
second compared value is the product of a method comprising comparing the level of 
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expression of each of said markers in a sample from a cell or organism characterized as 
having said first phenotype to the level of expression of each of said markers, respectively, in 
the pooled sample. The first compared value is then compared to a third compared value, 
wherein said third compared value is the product of a method comprising comparing the level 

5 of expression of each of the markers in a sample from a cell or organism characterized as 
having the second phenotype to the level of expression of each of the markers, respectively, 
in the pooled sample. In specific embodiments, the marker can be a gene, a protein encoded 
by the gene, etc. Optionally, the first compared value can be compared to additional 
compared values, respectively, where each additional compared value is the product of a 

10 method comprising comparing the level of expression of each of said markers in a sample 
from a cell or organism characterized as having a phenotype different from said first and 
second phenotypes but included among the at least two different phenotypes, to the level of 
expression of each of said genes, respectively, in said pooled sample. Finally, a 
determination is made as to which of said second, third, and, if present, one or more 

1 5 additional compared values, said first compared value is most similar, wherein the first cell or 
organism is determined to have the phenotype of the cell or organism used to produce said 
compared value most similar to said first compared value. 

The sensitivity of a marker-based assay will also be increased if the expression levels 
of individual markers are compared to the expression of the same markers in a control 

20 sample, e.g., a sample comprises a pool of samples, to generate a differential expression 
profile. Such a comparison may be accomplished, for example, by deterrriining a ratio 
between expression level of each marker in the sample and the expression level of the 
corresponding marker in the control sample. This has the effect of accentuating the relative 
differences in expression between markers in the sample and markers in the control as a 

25 whole, making subsequent comparisons to a template more sensitive and more likely to 

produce meaningful results than the use of absolute expression levels alone. The comparison 
may be performed in any convenient way, e.g., by taking difference, ratio, or log(ratio). 

In performing comparisons to a control sample, two approaches may be used. First, 
the expression levels of the markers in the sample may be compared to the expression level of 
30 those markers in the control sample, e.g., where nucleic acid derived from the sample and 
nucleic acid derived from the control are hybridized during the course of a single experiment. 
Such an approach requires that new control sample of nucleic acid be generated for each 
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comparison or limited numbers of comparisons, and is therefore limited by the amount of 
nucleic acid available. Alternatively, the expression levels in a control sample, whether 
normalized and/or transformed or not, are stored on a computer, or on computer-readable 



5 single-channel data). 

The methods of the invention preferably use a control or reference sample, which can 
be any suitable sample against which changes in cellular constituents can be determined. In 
one embodiment, the control or reference sample is generated by pooling together the 
plurality of cellular constituents, e.g., a plurality of transcripts or cDNAs, or a plurality of 

1 0 protein species, from a plurality of breast cancer patients. Alternatively, the control or 
reference sample can be generated by pooling together purified or synthesized cellular 
constituents, e.g., a plurality of purified or synthesized transcripts or cDNAs, a plurality of 
purified or synthesized protein species. In one embodiment, synthetic RNAs for each 
transcripts or cDNAs are pooled to form the control or reference sample. Preferably, the 

1 5 abundances of synthetic RNAs are approximately the abundances of the corresponding 
transcripts in a real tumor pool. The differential expression of marker genes for each 
individual patient sample is measured against this control sample. In one embodiment, 60- 
mer oligonucleotides corresponding to the probe sequences on a microarray used to assay the 
expression levels of the diagnostic/prognostic transcripts are synthesized and cloned into 

20 pBluescript SK- vector (Statagene, La Jolla, CA), adjacent to the T7 promoter sequence. 
Individual clones are isolated, and the sequences of their inserts are verified by DNA 
sequencing. To generate synthetic RNAs, clones are linearized with £coRI and a T7 in vitro 
transcription (TVT) reaction is performed by MegaScript kit (Ambion, Austin, TX), followed 
by DNase treatment of the product. Synthetic RNAs are purified on RNeasy columns 

25 (Qiagen, Valencia, CA). These synthetic RNAs are transcribed, amplified, labeled, and 
mixed together to make the reference pool. The abundance of those synthetic RNAs are 
chosen to approximate the abundances of the transcripts of the corresponding marker genes in 
the real tumor pool. 

The current invention provides the following method of classifying a first cell or 
30 organism as having one of at least two different phenotypes, where the different phenotypes 
comprise a first phenotype and a second phenotype. The level of expression of each of a 
plurality of markers in a first sample from the first cell or organism is compared to the level 



media, to be used in comparisons to the individual expression level data from the sample (i.e., 
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of expression of each of said markers, respectively, in a pooled sample from a plurality of 
cells or organisms, the plurality of cells or organisms comprising different cells or organisms 
exhibiting said at least two different phenotypes, respectively, to produce a first compared 
value so that a first differential profile comprising a plurality of first compared values for said 
5 plurality of markers is generated. The first differential profile is then compared to a second 
differential profile comprising a plurality of second compared values, wherein each said 
second compared value is the product of a method comprising comparing the level of 
expression of each of said markers in a sample from a cell or organism characterized as 
having said first phenotype to the level of expression of each of said markers, respectively, in 

1 0 the pooled sample. The first differential profile is then compared to a third differential profile 
comprising a plurality of third compared values, wherein each said third compared value is 
the product of a method comprising comparing the level of expression of each of the markers 
in a sample from a cell or organism characterized as having the second phenotype to the level 
of expression of each of the markers, respectively, in the pooled sample. In specific 

1 5 embodiments, each marker can be a gene, a protein encoded by the gene, etc. Optionally, the 
first differential profile can be compared to additional expression profiles each of which 
comprising additional compared values, respectively, where each additional compared value 
is the product of a method comprising comparing the level of expression of each of said 
markers in a sample from a cell or organism characterized as having a phenotype different 

20 from said first and second phenotypes but included among the at least two different 

phenotypes, to the level of expression of each of said genes, respectively, in said pooled 
sample. Finally, a determination is made as to which of said second, third, and, if present, 
one or more additional differential profiles, said first differential profile is most similar, 
wherein the first cell or organism is determined to have the phenotype of the cell or organism 

25 used to produce said differential profile most similar to said first differential profile. 

In a specific embodiment of this method, the compared values are each ratios of the 
levels of expression of each of said genes. In another specific embodiment, each of the levels 
of expression of each of the genes in the pooled sample is normalized prior to any of the 
comparing steps. In a more specific embodiment, the normalization of the levels of 
30 expression is carried out by dividing by the median or mean level of the expression of each of 
the genes or dividing by the mean or median level of expression of one or more housekeeping 
genes in the pooled sample from said cell or organism. In another specific embodiment, the 
normalized levels of expression are subjected to a log transform, and the comparing steps 
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comprise subtracting the log transform from the log of the levels of expression of each of the 
genes in the sample. In another specific embodiment, the two or more different phenotypes 
are different stages of a disease or disorder. In still another specific embodiment, the two or 
more different phenotypes are different prognoses of a disease or disorder. In yet another 
5 specific embodiment, the levels of expression of each of the genes, respectively, in the pooled 
sample or said levels of expression of each of said genes in a sample from the cell or 
organism characterized as having the first phenotype, second phenotype, or said phenotype 
different from said first and second phenotypes, respectively, are stored on a computer or on 
a computer-readable medium. 

10 In another specific embodiment, the two phenotypes are good prognosis and poor 

prognosis. In a more specific embodiment, the two phenotypes are good prognosis and poor 
prognosis for an individual that is identified as having ER~ BRCA1 status, ER , sporadic 
status, ER+, ER/AGE high status, ER+, ER/AGE low, LN+ status, or ER+, ER/AGE low, 
LN+ status. 

1 5 In another specific embodiment, the comparison is made between the expression 

profile of the genes in the sample and the expression profile of the same genes in a pool 
representing only one of two or more phenotypes. In the context of prognosis-correlated 
genes, for example, one can compare the expression levels of prognosis-related genes in a 
sample to the average levels of the expression of the same genes in a plurality of "good 

20 prognosis" samples (as opposed to a plurality of samples that include samples from patients 
having poor prognoses and good prognoses). Thus, in this method, a sample is classified as 
having a good prognosis if the expression profile of prognosis-correlated genes exceeds a 
chosen coefficient of correlation to the average "good prognosis" expression profile {e.g., the 
profile comprising average levels of expression of prognosis-correlated genes in samples 

25 from a plurality of patients having a "good prognosis"). Patients whose expression profiles 
correlate more poorly with the "good prognosis" expression profile (e.g., whose correlation 
coefficient fails to exceed the chosen coefficient) are classified as having a poor prognosis. 

Where individuals are classified on the basis of phenotypic, genotypic, or clinical 
characteristics into patient subsets, the pool of samples may be a pool of samples for the 
30 phenotype that includes samples representing each of the patient subsets. Alternatively, the 
pool of samples may be a pool of samples for the phenotype representing only the specific 
patient subset. For example, where an individual is classified as ER+, sporadic, the pool of 
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samples to which the individual's sample is compared may be a pool of samples from ER+, 
sporadic individuals having a good prognosis only, or may be a pool of samples of 
individuals having a good prognosis, without regard to ER status or mutation type. 

The method can be applied to a plurality of patient subsets. For example, in a specific 
5 embodiment, the phenotype is good prognosis, and the individual is classified into one of the 
following patient subsets: ER - , BRCA1 status, ER~, sporadic status, ER+, ER/AGE high 
status, ER+, ER/AGE low, LN+ status, or ER+, ER/AGE low, LN+ status. A set of markers 
informative for prognosis for the patient subset into which the individual is classified is then 
used to determine the likely prognosis for the individual. A sample is classified as coming 
10 from an individual having a good prognosis if the expression profile of prognosis-correlated 
genes for the particular subset into which the individual is classified exceeds a chosen 
coefficient of correlation to the average "good prognosis" expression profile {e.g., the levels 
of expression of prognosis-correlated genes in a plurality of samples from patients within the 
subclass having a "good prognosis"). Patients whose expression levels correlate more poorly 
1 5 with the "good prognosis" expression profile (e.g., whose correlation coefficient fails to 
exceed the chosen coefficient) are classified as having a poor prognosis. 

Of course, single-channel data may also be used without specific comparison to a 
mathematical sample pool. For example, a sample may be classified as having a first or a 
second phenotype, wherein the first and second phenotypes are related, by calculating the 

20 similarity between the expression profile of at least 5 markers in the sample, where the 

markers are correlated with the first or second phenotype, to a first phenotype template and a 
second phenotype template each comprising the expression levels of the same markers, by (a) 
labeling nucleic acids derived from a sample with a fluorophore to obtain a pool of 
fluorophore-labeled nucleic acids; (b) contacting said fluorophore-labeled nucleic acid with a 

25 microarray under conditions such that hybridization can occur, detecting at each of a plurality 
of discrete loci on the microarray a fluorescent emission signal from said fluorophore-labeled 
nucleic acid that is bound to said microarray under said conditions; and (c) determining the 
similarity of marker gene expression in the individual sample to the first and second 
templates, wherein if said expression is more similar to the first template, the sample is 

30 classified as having the first phenotype, and if said expression is more similar to the second 
template, the sample is classified as having the second phenotype. 
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In a specific embodiment of the above method, the first phenotype is a good prognosis 
of breast cancer, the sample is a sample from an individual that has been classified into a 
patient subset, and the first and second templates are templates for the phenotype for the 
particular patient subset. In a more specific embodiment, for example, the first phenotype is 

5 a good prognosis, the second phenotype is a poor prognosis, the patient is classified into an 
ER~, sporadic patient subset, an ER~ BRCA1 subset, an ER+, ER/AGE high subset, an ER+, 
ER/AGE low, LN+ subset, or an ER+, ER/AGE low, LN+ subset, and said first and second 
templates are templates derived from the expression of the marker genes in individuals 
having a good prognosis and a poor prognosis, respectively, wherein said individuals are all 

10 of the patient subset into which said patient is classified. 

5.2.4. METHODS FOR CLASSIFICATION OF EXPR ESSION PROFILES 
In preferred embodiments, the methods of the invention use a classifier for predicting 
prognosis in a patient. The classifier can be based on any appropriate pattern recognition 
method that receives an input comprising a marker profile and provides an output comprising 
15 data indicating which patient subset the patient belongs. The classifier can be trained with 
training data from a training population of breast cancer patients. Typically, the training data 
comprise for each of the patients in the training population a training marker profile 
comprising measurements of respective gene products of a plurality of genes in a suitable 
sample taken from the patient and prognosis information. 

20 In preferred embodiments, the classifier can be based on a classification (pattern 

recognition) method described below, e.g., profile similarity (Section 5.2.4.1., infra); 
artificial neural network (Section 5.2.4.2., infra); support vector machine (SVM, Section 
5.2.4.3., infra); logic regression (Section 5.2.4.4., infra), linear or quadratic discriminant 
analysis (Section 5.2.4.5., infra), decision trees (Section 5.2.4.6., infra), clustering (Section 

25 5.2.4.7., infra), principal component analysis (Section 5.2.4.8., infra), nearest neighbor 

classifier analysis (Section 5.2.4.9., infra). Such classifiers can be trained with the training 
population using methods described in the relevant sections, infra. 

The marker profile can be obtained by measuring the plurality of gene products in a 
cell sample from the patient using a method known in the art, e.g., a method described in 
30 Section 5.3., infra. 
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Various known statistical pattern recognition methods can be used in conjunction with 
the present invention. A classifier based on any of such methods can be constructed using the 
marker profiles and prognosis data of training patients. Such a classifier can then be used to 
evaluate the prognosis status of a patient based on the patient's marker profile. The methods 
5 can also be used to identify markers that discriminate between different prognosis status 
using a marker profile and prognosis data of training patients. 

5.2.4.1. PROFILE MATCHING 

A patient's prognosis can be evaluated by comparing a marker profile obtained in a 
suitable sample from the patient with a marker profile that is representative of a particular 

10 prognosis. Such a marker profile is also termed a "template profile" or a "template." The 
degree of similarity to such a template profile provides an evaluation of the patient's 
prognosis. If the degree of similarity of the patient marker profile and a template profile is 
above a predetermined threshold, the patient is assigned the prognosis represented by the 
template. For example, a patient's prognosis can be evaluated by comparing a marker profile 

15 of the patient to a predetermined template profile corresponding to a given prognosis, e.g., a 
good prognosis template comprising measurements of the plurality of markers which are 
representative of levels of the markers in a plurality of patients having a good prognosis. 

In one embodiment, the similarity is represented by a correlation coefficient between 
the patient's profile and the template. In one embodiment, a correlation coefficient above a 
20 correlation threshold indicates a high similarity, whereas a correlation coefficient below the ' 
threshold indicates a low similarity. 

In a specific embodiment, P t measures the similarity between the patient's profile y 
and a template profile comprising measurements of marker gene products representative of 
measurements of marker gene products in patients having a particular prognosis, e.g., the 
25 good prognosis template z, or the poor prognosis template z 2 . Such a coefficient, P h can be 
calculated using the following equation: 

where i designates the ith template. Thus, in one embodiment, y is classified as a good 
prognosis profile if P, is greater than a selected correlation threshold. In another 
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embodiment, y is classified as poor prognosis profile if P 2 is greater than a selected 
correlation threshold. In preferred embodiments, the correlation threshold is set as 0.3, 0.4, 
0.5 or 0.6. In another embodiment, y is classified as a good prognosis profile if P, is greater 
than P 2 , whereas y is classified as a poor prognosis profile if P, is less than P 2 . 

In another embodiment, the correlation coefficient is a weighted dot product of the 
patient's profile y and a template profile, in which measurements of each different marker is 
assigned a weight. 

In another embodiment, similarity between a patient's profile and a template is 
represented by a distance between the patient's profile and the template. In one embodiment, 
a distance below a given value indicates high similarity, whereas a distance equal to or 
greater than the given value indicates low similarity. 

In one embodiment, the Euclidian distance according to the formula 

D,=\\y-m 

is used, where D t measures the distance between the patient's profile y and a template 
profile comprising measurements of marker gene products representative of measurements of 
marker gene products in patients having a particular prognosis, e.g., the good prognosis 
template z, or the poor prognosis template z 2 . In other embodiments, the Euclidian distance 
is squared to place progressively greater weight on cellular constituents that are further apart. 
In alternative embodiments, the distance measure D, is the Manhattan distance provide by 

A=2>(")-*,(»)| 

where y(n) and z t (n) are respectively measurements of the nth marker gene product in the 
patient's profile y and a template profile. 

In another embodiment, the distance is defined as Z>. = 1 - i> , where P, is the 
correlation coefficient or normalized dot product as described above. 

In still other embodiments, the distance measure may be the Chebychev distance, the 
power distance, and percent disagreement, all of which are well known in the art. 
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5.2.4.2. ARTIFICIAL NEURAL NETWORK 

In some embodiments, a neural network is used. A neural network can be 
constructed for a selected set of molecular markers of the invention. A neural network is a 
two-stage regression or classification model. A neural network has a layered structure that 
5 includes a layer of input units (and the bias) connected by a layer of weights to a layer of 
output units. For regression, the layer of output units typically includes just one output unit. 
However, neural networks can handle multiple quantitative responses in a seamless fashion. 

In multilayer neural networks, there are input units (input layer), hidden units (hidden 
layer), and output units (output layer). There is, furthermore, a single bias unit that is 
1 0 connected to each unit other than the input units. Neural networks are described in Duda et 
al, 2001, Pattern Classification, Second Edition, John Wiley & Sons, Inc., New York; and 
Hastie et al. , 2001 , The Elements of Statistical Learning, Springer-Verlag, New York. 

The basic approach to the use of neural networks is to start with an untrained network, 
present a training pattern, e.g., marker profiles from training patients, to the input layer, and 

1 5 to pass signals through the net and determine the output, e.g., the prognosis in the training 
patients, at the output layer. These outputs are then compared to the target values; any 
difference corresponds to an error. This error or criterion function is some scalar function of 
the weights and is minimized when the network outputs match the desired outputs. Thus, the 
weights are adjusted to reduce this measure of error. For regression, this error can be sum-of- 

20 squared errors. For classification, this error can be either squared error or cross-entropy 
(deviation). See, e.g., Hastie et al., 2001, The Elements of Statistical Learning, Springer- 
Verlag, New York. 

Three commonly used training protocols are stochastic, batch, and on-line. In 
stochastic training, patterns are chosen randomly from the training set and the network 

25 weights are updated for each pattern presentation. Multilayer nonlinear networks trained by 
gradient descent methods such as stochastic back-propagation perform a maximum- 
likelihood estimation of the weight values in the model defined by the network topology. In 
batch training, all patterns are presented to the network before learning takes place. 
Typically, in batch training, several passes are made through the training data. In online 

30 training, each pattern is presented once and only once to the net. 
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In some embodiments, consideration is given to starting values for weights. If the 
weights are near zero, then the operative part of the sigmoid commonly used in the hidden 
layer of a neural network (see, e.g., Hastie etal, 2001, The Elements of Statistical Learning, 
Springer- Verlag, New York) is roughly linear, and hence the neural network collapses into an 
5 approximately linear model. In some embodiments, starting values for weights are chosen to 
be random values near zero. Hence the model starts out nearly linear, and becomes nonlinear 
as the weights increase. Individual units localize to directions and introduce nonlinearities 
where needed. Use of exact zero weights leads to zero derivatives and perfect symmetry, and 
the algorithm never moves. Alternatively, starting with large weights often leads to poor 
10 solutions. 

Since the scaling of inputs determines the effective scaling of weights in the bottom 
layer, it can have a large effect on the quality of the final solution. Thus, in some 
embodiments, at the outset all expression values are standardized to have mean zero and a 
standard deviation of one. This ensures all inputs are treated equally in the regularization 
1 5 process, and allows one to choose a meaningful range for the random starting weights. With 
standardization inputs, it is typical to take random uniform weights over the range [-0.7, 
+0.7]. 

A recurrent problem in the use of networks having a hidden layer is the optimal 
number of hidden units to use in the network. The number of inputs and outputs of a network 

20 are determined by the problem to be solved. In the present invention, the number of inputs 
for a given neural network can be the number of molecular markers in the selected set of 
molecular markers of the invention. The number of output for the neural network will 
typically be just one. However, in some embodiment more than one output is used so that 
more than just two states can be defined by the network. If too many hidden units are used in 

25 a neural network, the network will have too many degrees of freedom and is trained too long, 
there is a danger that the network will overfit the data If there are too few hidden units, the 
training set cannot be learned. Generally speaking, however, it is better to have too many 
hidden units than too few. With too few hidden units, the model might not have enough 
flexibility to capture the nonlinearities in the data; with too many hidden units, the extra 

30 weight can be shrunk towards zero if appropriate regularization or pruning, as described 
below, is used. In typical embodiments, the number of hidden units is somewhere in the 
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range of 5 to 100, with the number increasing with the number of inputs and number of 
training cases. 

One general approach to determining the number of hidden units to use is to apply a 
regularization approach. In the regularization approach, a new criterion function is 
5 constructed that depends not only on the classical training error, but also on classifier 
complexity. Specifically, the new criterion function penalizes highly complex models; 
searching for the minimum in this criterion is to balance error on the training set with error on 
the training set plus a regularization term, which expresses constraints or desirable properties 
of solutions: 

10 J = Jpa, + U reg . 

The parameter X is adjusted to impose the regularization more or less strongly. In other 
words, larger values for X will tend to shrink weights towards zero: typically cross-validation 
with a validation set is used to estimate X. This validation set can be obtained by setting aside 
a random subset of the training population. Other forms of penalty can also be used, for 
1 5 example the weight elimination penalty (see, e.g. , Hastie et al. , 2001 , The Elements of 
Statistical Learning, Springer-Verlag, New York). 

Another approach to determine the number of hidden units to use is to eliminate - 
prune - weights that are least needed. In one approach, the weights with the smallest 
magnitude are eliminated (set to zero). Such magnitude-based pruning can work, but is 

20 nonoptimal; sometimes weights with small magnitudes are important for learning and 

training data. In some embodiments, rather than using a magnitude-based priming approach, 
Wald statistics are computed. The fundamental idea in Wald Statistics is that they can be 
used to estimate the importance of a hidden unit (weight) in a model. Then, hidden units 
having the least importance are eliminated (by setting their input and output weights to zero). 

25 Two algorithms in this regard are the Optimal Brain Damage (OBD) and the Optimal Brain 
Surgeon (OBS) algorithms that use second-order approximation to predict how the training 
error depends upon a weight, and eliminate the weight that leads to the smallest increase in 
training error. 

Optimal Brain Damage and Optimal Brain Surgeon share the same basic approach of 
30 training a network to local minimum error at weight w, and then pruning a weight that leads 
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to the smallest increase in the training error. The predicted functional increase in the error for 
a change in full weight vector 5w is: 



^g'.* + I*,.g.a, +0 t*f) 



d J 

where — T is the Hessian matrix. The first term vanishes because we are at a local 

dw 

5 minimum in error; third and higher order terms are ignored. The general solution for 
minimizing this function given the constraint of deleting one weight is: 



Here, u q is the unit vector along the qth direction in weight space and L q is approximation to 
the saliency of the weight q - the increase in training error if weight q is pruned and the other 
weights updated 8w. These equations require the inverse of H. One method to calculate this 
inverse matrix is to start with a small value, 1 = a\ where a is a small parameter - 
effectively a weight constant. Next the matrix is updated with each pattern according to 

TT-l TT-1 H m 1 X_ +1 XL..H ' 



where the subscripts correspond to the pattern being presented and a„, decreases with m. 
1 5 After the full training set has been presented, the inverse Hessian matrix is given by H" 1 = 
H~ l . In algorithmic form, the Optimal Brain Surgeon method is: 

begin initialize w, 0 

train a reasonably large network to minimum error 

do compute H' 1 by Eqn. 1 

20 q* <- arg min w] /^j/T 1 ^) (saliency Lq) 
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w <- w- i — — H^e^ (saliencyL q ) 

until J(w) > 6 
return w 
end 

5 The Optimal Brain Damage method is computationally simpler because the 

calculation of the inverse Hessian matrix in line 3 is particularly simple for a diagonal matrix. 
The above algorithm terminates when the error is greater than a criterion initialized to be 8. 
Another approach is to change line 6 to terminate when the change in J(w) due to elimination 
of a weight is greater than some criterion value. 

10 In some embodiments, a back-propagation neural network (see, for example Abdi, 

1994, "A neural network primer", J. Biol System. 2, 247-283) containing a single hidden 
layer of ten neurons (ten hidden units) found in EasyNN-Plus version 4.0g software package 
(Neural Planner Software Inc.) is used. In a specific example, parameter values within the 
EasyNN-Plus program are set as follows: a learning rate of 0.05, and a momentum of 0.2. In 

1 5 some embodiments in which the EasyNN-Plus version 4.0g software package is used, 

"outlier" samples are identified by performing twenty independently-seeded trials involving 
20,000 learning cycles each. 

5.2.4.3. SUPPORT VECTOR MACHINE 
In some embodiments of the present invention, support vector machines (SVMs) are 

20 used to classify subjects using expression profiles of marker genes described in the present 
invention. General description of SVM can be found in, for example, Cristianini and Shawe- 
Taylor, 2000, An Introduction to Support Vector Machines, Cambridge University Press, 
Cambridge, Boser et al, 1992, "A training algorithm for optimal margin classifiers, in 
Proceedings of the 5 th Annual ACM Workshop on Computational Learning Theory, ACM 

25 Press, Pittsburgh, PA, pp. 142-152; Vapnik, 1998, Statistical Learning Theory, Wiley, New 
York; Duda, Pattern Classification, Second Edition, 2001, John Wiley & Sons, Inc.; Hastie, 
2001, The Elements of Statistical Learning, Springer, New York; and Furey et al, 2000, 
Bioinformatics 16, 906-914. Applications of SVM in biological applications are described in 
Jaakkola et al., Proceedings of the 7 th International Conference on Intelligent Systems for 
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Molecular Biology, AAAI Press, Menlo Park, CA (1999); Brown et al, Proc. Natl. Acad. Sci. 
97(l):262-67 (2000); Ziene/a/., Bioinformatics, 16(9):799-807 (2000); Furey etal, 
Bioinformatics, 16(10):906-914 (2000) 

In one approach, when a SVM is used, the gene expression data is standardized to 
5 have mean zero and unit variance and the members of a training population are randomly 
divided into a training set and a test set. For example, in one embodiment, two thirds of the 
members of the training population are placed in the training set and one third of the 
members of the training population are placed in the test set. The expression values for a 
selected set of genes of the present invention is used to train the SVM. Then the ability for 
10 the trained SVM to correctly classify members in the test set is determined. In some 
embodiments, this computation is performed several times for a given selected set of 
molecular markers. In each iteration of the computation, the members of the training 
population are randomly assigned to the training set and the test set. Then, the quality of the 
combination of molecular markers is taken as the average of each such iteration of the SVM 
15 computation. 

Support vector machines map a given set of binary labeled training data to a high- 
dimensional feature space and separate the two classes of data with a maximum margin 
hyperplane. In general, this hyperplane corresponds to a nonlinear decision boundary in the 
input space. Let X e R 0 c 9T be the input vectors, y e {-1,+1} be the labels, and 
20 <j> : R 0 -» F be the mapping from input space to feature space. Then the SVM learning 
algorithm finds a hyperplane (w,b) such that the quantity 



is maximized, where the vector w has the same dimensionality as F, b is a real number, and y 
is called the margin. The corresponding decision function is then 



r = rmny l {(w,MX i ))-b} 



25 



f(X) = sign((w,^X))-b) 



This minimum occurs when 



Ml 
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where {a?} are positive real numbers that maximize 

i ij 

subject to 

^ v, =(),«, >0 

5 The decision function can equivalently be expressed as 

/(X) = sign(Za l y l (KX l ,4(R)-b) 

From this equation it can be seen that the a, associated with the training point Xj 
expresses the strength with which that point is embedded in the final decision function. A 
remarkable property of this alternative representation is that only a subset of the points will 
10 be associated with a non-zero a,. These points are called support vectors and are the points 
that lie closest to the separating hyperplane. The sparseness of the a vector has several 
computational and learning theoretic consequences. It is important to note that neither the 
learning algorithm nor the decision function needs to represent explicitly the image of points 
in the feature space, #X,) , since both use only the dot products between such images, 
1 5 {tfX,), <f>(Xj)) . Hence, if one were given a function K(X,Y) = (0(X), ftX)) , one could 
learn and use the maximum margin hyperplane in the feature space without ever explicitly 
performing the mapping. For each continuous positive definite function K(X,Y) there exists 
a mapping ^suchthat K(X,Y) = (^(X),^(X)) for all X,YeR 0 (Mercer's Theorem). The 
function K(X,Y) is called the kernel Junction. The use of a kernel function allows the 
20 support vector machine to operate efficiently in a nonlinear high-dimensional feature spaces 
without being adversely affected by the dimensionality of that space. Indeed, it is possible to 
work with feature spaces of infinite dimension. Moreover, Mercer's theorem makes it 
possible to learn in the feature space without even knowing i> and F. The matrix 
K 0 =(#X,),0(X,)) is called the kernel matrix. Finally, note that the learning algorithm is a 
25 quadratic optimization problem that has only a global optimum. The absence of local minima 
is a significant difference from standard pattern recognition techniques such as neural 
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networks. For moderate sample sizes, the optimization problem can be solved with simple 
gradient descent techniques. In the presence of noise, the standard maximum margin 
algorithm described above can be subject to overfitting, and more sophisticated techniques 
should be used. This problem arises because the maximum margin algorithm always finds a 
5 perfectly consistent hypothesis and does not tolerate training error. Sometimes, however, it is 
necessary to trade some training accuracy for better predictive power. The need for tolerating 
training error has led to the development the soft-margin and the margin-distribution 
classifiers. One of these techniques replaces the kernel matrix in the training phase as 
follows: 



while still using the standard kernel function in the decision phase. By tuning X, one can 
control the training error, and it is possible to prove that the risk of misclassifying unseen 
points can be decreased with a suitable choice of A,. 

If instead of controlling the overall training error one wants to control the trade-off 
1 5 between false positives and false negatives, it is possible to modify K as follows: 



where D is a diagonal matrix whose entries are either d + or d~ , in locations corresponding to 
positive and negative examples. It is possible to prove that this technique is equivalent to 
controlling the size of the a, in a way that depends on the size of the class, introducing a bias 

20 for larger a t in the class with smaller d. This in turn corresponds to an asymmetric margin; 
i.e., the class with smaller d will be kept further away from the decision boundary. In some 
cases, the extreme imbalance of the two classes, along with the presence of noise, creates a 
situation in which points from the minority class can be easily mistaken for mislabelled 
points. Enforcing a strong bias against training errors in the minority class provides 

25 protection agaist such errors and forces the SVM to make the positive examples support 

vectors. Thus, choosing d + =-|- and d~ =— provides a heuristic way to automatically 
n n 

adjust the relative importance of the two classes, based on their respective cardinalities. This 
technique effectively controls the trade-off between sensitivity and specificity. 



10 



K+-K+AD 
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In the present invention, a linear kernel can be used. The similarity between two 
marker profiles X and Y can be the dot product X-Y. In one embodiment, the kernel is 

£(X,Y) = X-Y+1 
In another embodiment, a kernel of degree d is used 

K(X, Y) = (X-Y + l) d , where d can be either 2,3,... 
In still another embodiment, a Gaussian kernel is used 



K(X,Y) = exp(- 




where a is the width of the Gaussian. 

5.2.4.4. LOGISTIC REGRESSION 

In some embodiments, the classifier is based on a regression model, preferably a 
logistic regression model. Such a regression model includes a coefficient for each of the 
molecular markers in a selected set of molecular markers of the invention. In such 
embodiments, the coefficients for the regression model are computed using, for example, a 
maximum likelihood approach. In particular embodiments, molecular marker data from two 
different clinical groups, e.g., good or poor prognosis, is used and the dependent variable is 
the clinical status of the patient for which molecular marker characteristic data are from. 

Some embodiments of the present invention provide generalizations of the logistic 
regression model that handle multicategory (polychotomous) responses. Such embodiments 
can be used to discriminate an organism into one or three or more clinical groups, e.g., good, 
intermediate, and poor prognosis. Such regression models use multicategory logit models 
that simultaneously refer to all pairs of categories, and describe the odds of response in one 
category instead of another. Once the model specifies logits for a certain (J-l) pairs of 
categories, the rest are redundant. See, for example, Agresti, An Introduction to Categorical 
Data Analysis, John Wiley & Sons, Inc., 1996, New York, Chapter 8, which is hereby 
incorporated by reference. 

5.2.4.5. DISCRIMINANT ANALYSIS 
Linear discriminant analysis (LDA) attempts to classify a subject into one of two 
categories based on certain object properties. In other words, LDA tests whether object 
attributes measured in an experiment predict categorization of the objects. LDA typically 
requires continuous independent variables and a dichotomous categorical dependent variable. 
In the present invention, the expression values for the selected set of molecular markers of the 
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invention across a subset of the training population serve as the requisite continuous 
independent variables. The clinical group classification of each of the members of the 
training population serves as the dichotomous categorical dependent variable. 

LDA seeks the linear combination of variables that maximizes the ratio of between- 
5 group variance and within-group variance by using the grouping information. Implicitly, the 
linear weights used by LDA depend on how the expression of a molecular marker across the 
training set separates in the two groups {e.g., a group that has good prognosis and a group that 
have poor prognosis) and how this gene expression correlates with the expression of other 
genes. In some embodiments, LDA is applied to the data matrix of the N members in the 

10 training sample by K genes in a combination of genes described in the present invention. 
Then, the linear discriminant of each member of the training population is plotted. Ideally, 
those members of the training population representing a first subgroup {e.g. those subjects 
that good prognosis) will cluster into one range of linear discriminant values {e.g., negative) 
and those member of the training population representing a second subgroup {e.g. those 

1 5 subjects that have poor prognosis) will cluster into a second range of linear discriminant 

values {e.g., positive). The LDA is considered more successful when the separation between 
the clusters of discriminant values is larger. For more information on linear discriminant 
analysis, see Duda, Pattern Classification, Second Edition, 2001, John Wiley & Sons, Inc; 
and Hastie, 2001, The Elements of Statistical Learning, Springer, New York; Venables & 

20 Ripley, 1 997, Modern Applied Statistics with s-plus, Springer, New York. 

Quadratic discriminant analysis (QDA) takes the same input parameters and returns 
the same results as LDA. QDA uses quadratic equations, rather than linear equations, to 
produce results. LDA and QDA are interchangeable, and which to use is a matter of 
preference and/or availability of software to support the analysis. Logistic regression takes 
25 the same input parameters and returns the same results as LDA and QDA. 



In some embodiments of the present invention, decision trees are used to classify 
patients using expression data for a selected set of molecular markers of the invention. 
Decision tree algorithms belong to the class of supervised learning algorithms. The aim of a 
30 decision tree is to induce a classifier (a tree) from real-world example data. This tree can be 
used to classify unseen examples which have not been used to derive the decision tree. 



5.2.4.6. DECISION TREES 
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A decision tree is derived from training data. An example contains values for the 
different attributes and what class the example belongs. In one embodiment, the training data 
is expression data for a combination of genes described in the present invention across the 
training population. 

5 The following algorithm describes a decision tree derivation: 

Tree(Examples,Class,Attributes) 

Create a root node 

If all Examples have the same Class value, give the root this label 
Else if Attributes is empty label the root according to the most common value 
10 Else begin 

Calculate the information gain for each attribute 

Select the attribute A with highest information gain and make this the root 

attribute 

For each possible value, v, of this attribute 
1 5 Add a new branch below the root, corresponding to A = v 

Let Examples(v) be those examples with A = v 

If Examples(v) is empty, make the new branch a leaf node labeled with 
the most common value among Examples 

Else let the new branch be the tree created by 

20 Tree(Examples(v),Class,Attributes - {A}) 

end 
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A more detailed description of the calculation of information gain is shown in the following. 
If the possible classes v s of the examples have probabilities P(vO then the information content 
I of the actual answer is given by: 

I(P(v l ) > ... f P(v I( ))=X-P(v,)log 2 P(v i ) 

i=l 

5 The I- value shows how much information we need in order to be able to describe the 

outcome of a classification for the specific dataset used. Supposing that the dataset contains 
p positive (e.g. has poor prognosis) and n negative (e.g. has good prognosis) examples (e.g. 
individuals), the information contained in a correct answer is: 

) = — — Log 2 — log 2 

p + n p + n p + n p + n p + n p + n 

10 where log 2 is the logarithm using base two. By testing single attributes the amount of 
information needed to make a correct classification can be reduced. The remainder for a 
specific attribute A (e.g. a gene) shows how much the information that is needed can be 
reduced. 

It P + n p,+n t p. + n, 

15 "v" is the number of unique attribute values for attribute A in a certain dataset, "i" is a certain 
attribute value, "ps" is the number of examples for attribute A where the classification is 
positive (e.g. cancer), "n" is the number of examples for attribute A where the classification 
is negative (e.g. healthy). 

The information gain of a specific attribute A is calculated as the difference between 
20 the information content for the classes and the remainder of attribute A: 

Gain(A) = J(-^-, — — ) -Re maindei{A) 
p + n p+n 



-78- 




WO 2006/084272 PCT/US2006/004280 

The information gain is used to evaluate how important the different attributes are for the 
classification (how well they split up the examples), and the attribute with the highest 
information. 

In general there are a number of different decision tree algorithms, many of which are 
5 described in Duda, Pattern Classification, Second Edition, 2001 , John Wiley & Sons, Inc. 
Decision tree algorithms often require consideration of feature processing, impurity measure, 
stopping criterion, and pruning. Specific decision tree algorithms include, cut are not limited 
to classification and regression trees (CART), multivariate decision trees, ID3, and C4.5. 

In one approach, when an exemplary embodiment of a decision tree is used, the gene 
1 0 expression data for a selected set of molecular markers of the invention across a training 

population is standardized to have mean zero and unit variance. The members of the training 
population are randomly divided into a training set and a test set. For example, in one 
embodiment, two thirds of the members of the training population are placed in the training 
set and one third of the members of the training population are placed in the test set. The 
1 5 expression values for a select combination of genes described in the present invention is used 
to construct the decision tree. Then, the ability for the decision tree to correctly classify 
members in the test set is determined. In some embodiments, this computation is performed 
several times for a given combination of molecular markers. In each iteration of the 
computation, the members of the training population are randomly assigned to the training set 
20 and the test set. Then, the quality of the combination of molecular markers is taken as the 
average of each such iteration of the decision tree computation. 

5.2.4.7. CLUSTERING 

In some embodiments, the expression values for a selected set of molecular markers 
of the invention are used to cluster a training set. For example, consider the case in which ten 
25 genes described in the present invention are used. Each member m of the training population 
will have expression values for each of the ten genes. Such values from a member m in the 
training population define the vector: 
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where X ini is the expression level of the 1 th gene in organism m. If there are m organisms in 
the training set, selection of i genes will define m vectors. Note that the methods of the 
present invention do not require that each the expression value of every single gene used in 
5 the vectors be represented in every single vector m. In other words, data from a subject in 
which one of the i" 1 genes is not found can still be used for clustering. In such instances, the 
missing expression value is assigned either a "zero" or some other normalized value. In some 
embodiments, prior to clustering, the gene expression values are normalized to have a mean 
value of zero and unit variance. 

1 0 Those members of the training population that exhibit similar expression patterns 

across the training group will tend to cluster together. A particular combination of genes of 
the present invention is considered to be a good classifier in this aspect of the invention when 
the vectors cluster into the trait groups found in the training population. For instance, if the 
training population includes patients with good or poor prognosis, a clustering classifier will 

1 5 cluster the population into two groups, with each group uniquely representing either a good 
, prognosis or a poor prognosis. 

Clustering is described on pages 21 1-256 of Duda and Hart, Pattern Classification 
and Scene Analysis, 1973, John Wiley & Sons, Inc., New York. As described in Section 6.7 
of Duda, the clustering problem is described as one of finding natural groupings in a dataset. 
20 To identify natural groupings, two issues are addressed. First, a way to measure similarity (or 
dissimilarity) between two samples is determined. This metric (similarity measure) is used to 
ensure that the samples in one cluster are more like one another than they are to samples in 
other clusters. Second, a mechanism for partitioning the data into clusters using the similarity 
measure is determined. 

25 Similarity measures are discussed in Section 6.7 of Duda, where it is stated that one 

way to begin a clustering investigation is to define a distance function and to compute the 
matrix of distances between all pairs of samples in a dataset. If distance is a good measure of 
similarity, then the distance between samples in the same cluster will be significantly less 
than the distance between samples in different clusters. However, as stated on page 215 of 

30 Duda, clustering does not require the use of a distance metric. For example, a nonmetric 

similarity function s(x, x') can be used to compare two vectors x and x'. Conventionally, s(x, 
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x') is a symmetric function whose value is large when x and x' are somehow "similar". An 
example of a nonmetric similarity function s(x, x') is provided on page 216 of Duda. 

Once a method for measuring "similarity" or "dissimilarity" between points in a 
dataset has been selected, clustering requires a criterion function that measures the clustering 
5 quality of any partition of the data. Partitions of the data set that extremize the criterion 
function are used to cluster the data. See page 217 of Duda. Criterion functions are 
discussed in Section 6.8 of Duda. 

More recently, Duda et al, Pattern Classification, 2 nd edition, John Wiley & Sons, 
Inc. New York, has been published. Pages 537-563 describe clustering in detail. More 

1 0 information on clustering techniques can be found in Kaufman and Rousseeuw, 1 990, 

Finding Groups in Data: An Introduction to Cluster Analysis, Wiley, New York, NY; Everitt, 
1993, Cluster analysis (3d ed.), Wiley, New York, NY; and Backer, 1 995, Computer-Assisted 
Reasoning in Cluster Analysis, Prentice Hall, Upper Saddle River, New Jersey. Particular 
exemplary clustering techniques that can be used in the present invention include, but are not 

1 5 limited to, hierarchical clustering (agglomerative clustering using nearest-neighbor algorithm, 
farthest-neighbor algorithm, the average linkage algorithm, the centroid algorithm, or the 
sum-of-squares algorithm), k-means clustering, fuzzy k-means clustering algorithm, and 
Jarvis-Patrick clustering. 



data. Principal component analysis is a classical technique to reduce the dimensionality of a 
data set by transforming the data to a new set of variable (principal components) that 
summarize the features of the data. See, for example, Jolliffe, 1986, Principal Component 
Analysis, Springer, New York. Principal components (PCs) are uncorrelate and are ordered 
25 such that the PC has the kih largest variance among PCs. The A* PC can be interpreted as 
the direction that maximizes the variation of the projections of the data points such that it is 
orthogonal to the first k - 1 PCs. The first few PCs capture most of the variation in the data 
set. In contrast, the last few PCs are often assumed to capture only the residual 'noise' in the 
data. 

30 PC A can also be used to create a classifier in accordance with the present invention. 

In such an approach, vectors for a selected set of molecular markers of the invention can be 



5.2.4.8. PRINCIPAL COMPONENT ANALYSIS 



20 



Principal component analysis (PCA) has been proposed to analyze gene expression 



-81- 



WO 2006/084272 



PCT7US2006/004280 



constructed in the same manner described for clustering above. In fact, the set of vectors, 
where each vector represents the expression values for the select genes from a particular 
member of the training population, can be considered a matrix. In some embodiments, this 
matrix is represented in a Free- Wilson method of qualitative binary description of monomers 
(Kubinyi, 1990, 3D QSAR in drug design theory methods and applications, Pergamon Press, 
Oxford, pp 589-638), and distributed in a maximally compressed space using PCA so that the 
first principal component (PC) captures the largest amount of variance information possible, 
the second principal component (PC) captures the second largest amount of all variance 
information, and so forth until all variance information in the matrix has been accounted for. 

Then, each of the vectors (where each vector represents a member of the training 
population) is plotted. Many different types of plots are possible. In some embodiments, a 
one-dimensional plot is made. In this one-dimensional plot, the value for the first principal 
component from each of the members of the training population is plotted. In this form of 
plot, the expectation is that members of a first group (e.g. good prognosis patients) will 
cluster in one range of first principal component values and members of a second group (e.g., 
poor prognosis patients) will cluster in a second range of first principal component values. 

In one example, the training population comprises two groups: a good prognosis 
group and a poor prognosis group. The first principal component is computed using the 
molecular marker expression values for the select genes of the present invention across the 
entire training population data set. Then, each member of the training set is plotted as a 
function of the value for the first principal component. In this example, those members of the 
framing population in which the first principal component is positive are the good prognosis 
patients and those members of the training population in which the first principal component 
is negative are poor prognosis patients. 

In some embodiments, the members of the training population are plotted against 
more than one principal component. For example, in some embodiments, the members of the 
training population are plotted on a two-dimensional plot in which the first dimension is the 
first principal component and the second dimension is the second principal component. In 
such a two-dimensional plot, the expectation is that members of each subgroup represented in 
the training population will cluster into discrete groups. For example, a first cluster of 
members in the two-dimensional plot will represent subjects with good prognosis, a second 
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cluster of members in the two-dimensional plot will represent subjects with poor prognosis, 
and so forth. 

In some embodiments, the members of the training population are plotted against 
more than two principal components and a determination is made as to whether the members 
of the training population are clustering into groups that each uniquely represents a subgroup 
found in the training population. In some embodiments, principal component analysis is 
performed by using the R mva package (Anderson, 1973, Cluster Analysis for applications, 
Academic Press, New York 1973; Gordon, Classification, Second Edition, Chapman and 
Hall, CRC, 1999.). Principal component analysis is further described in Duda, Pattern 
Classification, Second Edition, 2001, John Wiley & Sons, Inc. 

5.2.4.9. NEAREST NEIGHBOR CLASSIFIER ANALYSTS 
Nearest neighbor classifiers are memory-based and require no model to be fit. Given 
a query point x 0 , the k training points x (r) , r,...,k closest in distance to x 0 are identified and 
then the point x 0 is classified using the k nearest neighbors. Ties can be broken at random. In 
some embodiments, Euclidean distance in feature space is used to determine distance as: 

4o+<0-*„|- 

Typically, when the nearest neighbor algorithm is used, the expression data used to compute 
the linear discriminant is standardized to have mean zero and variance 1. In the present 
invention, the members of the training population are randomly divided into a training set and 
a test set. For example, in one embodiment, two thirds of the members of the training 
population are placed in the training set and one third of the members of the training 
population are placed in the test set. Profiles of a selected set of molecular markers of the 
invention represents the feature space into which members of the test set are plotted. Next, 
the ability of the training set to correctly characterize the members of the test set is computed. 
In some embodiments, nearest neighbor computation is performed several times for a given 
combination of genes of the present invention. In each iteration of the computation, the 
members of the training population are randomly assigned to the training set and the test set. 
Then, the quality of the combination of genes is taken as the average of each such iteration of 
the nearest neighbor computation. 
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The nearest neighbor rule can be refined to deal with issues of unequal class priors, 
differential misclassification costs, and feature selection. Many of these refinements involve 
some form of weighted voting for the neighbors. For more information on nearest neighbor 
analysis, see Duda, Pattern Classification, Second Edition, 2001, John Wiley & Sons, Inc; 
5 and Hastie, 2001 , The Elements of Statistical Learning, Springer, New York. 

5.2.4.10. EVOLUTIONARY METHODS 
Inspired by the process of biological evolution, evolutionary methods of classifier 
design employ a stochastic search for an optimal classifier. In broad overview, such methods 
create several classifiers - a population - from measurements of gene products of the present 

10 invention. Each classifier varies somewhat from the other. Next, the classifiers are scored on 
expression data across the training population. In keeping with the analogy with biological 
evolution, the resulting (scalar) score is sometimes called the fitness. The classifiers are 
ranked according to their score and the best classifiers are retained (some portion of the total 
population of classifiers). Again, in keeping with biological terminology, this is called 

15 survival of the fittest. The classifiers are stochastically altered in the next generation - the 
children or offspring. Some offspring classifiers will have higher scores than their parent in 
the previous generation, some will have lower scores. The overall process is then repeated 
for the subsequent generation: The classifiers are scored and the best ones are retained, 
randomly altered to give yet another generation, and so on. In part, because of the ranking, 

20 each generation has, on average, a slightly higher score than the previous one. The process is 
halted when the single best classifier in a generation has a score that exceeds a desired 
criterion value. More information on evolutionary methods is found in, for example, Duda, 
Pattern Classification, Second Edition, 2001, John Wiley & Sons, Inc. 

5.2.4.11. BAGGING. BOOSTING AND THE RANDOM SUBSPACE METHOD 
25 Bagging, boosting and the random subspace method are combining techniques that 

can be used to improve weak classifiers. These techniques are designed for, and usually 
applied to, decision trees. In addition, Skurichina and Duin provide evidence to suggest that 
such techniques can also be useful in linear discriminant analysis. 

In bagging, one samples the training set, generating random independent bootstrap 
30 replicates, constructs the classifier on each of these, and aggregates them by a simple 

majority vote in the final decision rule. See, for example, Breiman, 1996, Machine Learning 
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24, 123-140; and Efron & Tibshirani, An Introduction to Bootstrap, Chapman & Hall, New 
York, 1993. 

In boosting, classifiers are constructed on weighted versions of the training set, which 
are dependent on previous classification results. Initially, all objects have equal weights, and 
5 the first classifier is constructed on this data set. Then, weights are changed according to the 
performance of the classifier. Erroneously classified objects (molecular markers in the data 
set) get larger weights, and the next classifier is boosted on the reweighted training set. In 
this way, a sequence of training sets and classifiers is obtained, which is then combined by 
simple majority voting or by weighted majority voting in the final decision. See, for 
10 example, Freund & Schapire, "Experiments with a new boosting algorithm," Proceedings 13 th 
International Conference on Machine Learning, 1996, 148-156. 

To illustrate boosting, consider the case where there are two phenotypic groups 
exhibited by the population under study, phenotype 1 (e.g., good prognosis patients), and 
phenotype 2 (e.g., poor prognosis patients). Given a vector of molecular markers X, a 
1 5 classifier G(X) produces a prediction taking one of the type values in the two value set: { 
phenotype 1 , phenotype 2} . The error rate on the training sample is 



where N is the number of subjects in the teaming set (the sum total of the subjects that have 
20 either phenotype 1 or phenotype 2). 

A weak classifier is one whose error rate is only slightly better than random guessing. 
In the boosting algorithm, the weak classification algorithm is repeatedly applied to modified 
versions of the data, thereby producing a sequence of weak classifiers G m (x), m, = I, 2, M. 
The predictions from all of the classifiers in this sequence are then combined through a 
25 weighted majority vote to produce the final prediction: 
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Here <xi, ct 2 &m are computed by the boosting algorithm and their purpose is to weigh the 

contribution of each respective G m (x). Their effect is to give higher influence to the more 
accurate classifiers in the sequence. 



w n to each of the training observations (x h y,), i = 1, 2, N. Initially all the weights are set 
to w, = 1/N, so that the first step simply trains the classifier on the data in the usual manner. 
For each successive iteration m = 2, 3, iWthe observation weights are individually 
modified and the classification algorithm is reapplied to the weighted observations. At stem 
1 0 m, those observations that were misclassified by the classifier G m ./(x) induced at the previous 
step have their weights increased, whereas the weights are decreased for those that were 
classified correctly. Thus as iterations proceed, observations that are difficult to correctly 
classify receive ever-increasing influence. Each successive classifier is thereby forced to 
concentrate on those training observations that are missed by previous ones in the sequence. 

1 5 The exemplary boosting algorithm is summarized as follows: 



5 



The data modifications at each boosting step consist of applying weights wj, w>, .... 



. Initialize the observation weights w, = 1/N, / = 1, 2, N. 



2. For m = 1 to M: 



20 



(a) Fit a classifier G m (x) to the training set using weights w,. 



(b) Compute 



err,„ =- 




(c) Compute a m =log((l-err,„)/err„,). 



(d) Setw, 



■w, -expK -I(y t * G.(x,))],i = 1,2 N. 
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3. Output G(x) = sign \^a m G m (x)\ 



In the algorithm, the current classifier G m (x) is induced on the weighted observations at line 
5 2a. The resulting weighted error rate is computed at line 2b. Line 2c calculates the weight a m 
given to G,„(x) in producing the final classifier G(x) (line 3). The individual weights of each 
of the observations are updated for the next iteration at line 2d. Observations misclassified 
by G m (x) have their weights scaled by a factor exp(a,„), increasing their relative influence for 
inducing the next classifier G m +i(x) in the sequence. In some embodiments, modifications of 
1 0 the Freund and Schapire, 1 997, Journal of Computer and System Sciences 55, pp. 119-139, 
boosting method are used. See, for example, Hasti et al, The Elements of Statistical 
Learning, 2001, Springer, New York, Chapter 10. In some embodiments, boosting or 
adaptive boosting methods are used. 



15 Computer and System Sciences 55, pp. 1 19-139, are used. For example, in some 

embodiments, feature preselection is performed using a technique such as the nonparametric 
scoring methods of Park et al, 2002, Pac. Symp. Biocomput. 6, 52-63. Feature preselection 
is a form of dimensionality reduction in which the genes that discriminate between 
classifications the best are selected for use in the classifier. Then, the LogitBoost procedure 

20 introduced by Friedman et al. , 2000, Ann Stat 28, 337-407 is used rather than the boosting 
procedure of Freund and Schapire. In some embodiments, the boosting and other 
classification methods of Ben-Dor et al, 2000, Journal of Computational Biology 7, 559-583 
are used in the present invention. In some embodiments, the boosting and other classification 
methods of Freund and Schapire, 1997, Journal of Computer and System Sciences 55, 119- 

25 139, are used. 

In the random subspace method, classifiers are constructed in random subspaces of 
the data feature space. These classifiers are usually combined by simple majority voting in 
the final decision rule. See, for example, Ho, "The Random subspace method for 
constructing decision forests," IEEE Trans Pattern Analysis and Machine Intelligence, 1998; 



In some embodiments, modifications of Freund and Schapire, 1997, Journal of 



30 20(8): 832-844. 
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5.2.4.12. OTHER ALGORITHMS 

The pattern classification and statistical techniques described above are merely 
examples of the types of models that can be used to construct a model for classification. 
Moreover, combinations of the techniques described above can be used. Some combinations, 
5 such as the use of the combination of decision trees and boosting, have been described. 
However, many other combinations are possible. In addition, in other techniques in the art 
such as Projection Pursuit and Weighted Voting can be used to construct a classifier. 

5.3. DETERMINATION OF MARKER GENE EXPRESSION LEVELS 
5.3.1. METHODS 

1 0 The expression levels of the marker genes in a sample may be determined by any 

means known in the art. The expression level may be determined by isolating and 
determining the level (i.e., amount) of nucleic acid transcribed from each marker gene. 
Alternatively, or additionally, the level of specific proteins encoded by a marker gene may be 
determined. 

1 5 The level of expression of specific marker genes can be accomplished by determining 

the amount of mRNA, or polynucleotides derived therefrom, present in a sample. Any 
method for determining RNA levels can be used. For example, RNA is isolated from a 
sample and separated on an agarose gel. The separated RNA is then transferred to a solid 
support, such as a filter. Nucleic acid probes representing one or more markers are then 

20 hybridized to the filter by northern hybridization, and the amount of marker-derived RNA is 
determined. Such determination can be visual, or machine-aided, for example, by use of a 
densitometer. Another method of determining RNA levels is by use of a dot-blot or a slot- 
blot. In this method, RNA, or nucleic acid derived therefrom, from a sample is labeled. The 
RNA or nucleic acid derived therefrom is then hybridized to a filter containing 

25 oligonucleotides derived from one or more marker genes, wherein the oligonucleotides are 
placed upon the filter at discrete, easily-identifiable locations. Hybridization, or lack thereof, 
of the labeled RNA to the filter-bound oligonucleotides is determined visually or by 
densitometer. Polynucleotides can be labeled using a radiolabel or a fluorescent (i.e., visible) 
label. 

30 These examples are not intended to be limiting; other methods of determining RNA 

abundance are known in the art. 
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The level of expression of particular marker genes may also be assessed by 
determining the level of the specific protein expressed from the marker genes. This can be 
accomplished, for example, by separation of proteins from a sample on a polyacrylamide gel, 
followed by identification of specific marker-derived proteins using antibodies in a western 
blot. Alternatively, proteins can be separated by two-dimensional gel electrophoresis 
systems Two-dimensional gel electrophoresis is well-known in the art and typically involves 
isoelectric focusing along a first dimension followed by SDS-PAGE electrophoresis along a 
second dimension. See, e.g., Hames et al, 1990, Gel Electrophoresis of Proteins: A 
Practical APPROACH, IRL Press, New York; Shevchenko et al, Proc. Nat'lAcad. Sci. USA 
93:1440-1445 (1996); Sagliocco et al, Yeast 12:1519-1533 (1996); Lander, Science 
274:536-539 (1996). The resulting electropherograms can be analyzed by numerous 
techniques, including mass spectrometric techniques, western blotting and immunoblot 
analysis using polyclonal and monoclonal antibodies. 

Alternatively, marker-derived protein levels can be determined by constructing an 
antibody microarray in which binding sites comprise immobilized, preferably monoclonal, 
antibodies specific to a plurality of protein species encoded by the cell genome. Preferably, 
antibodies are present for a substantial fraction of the marker-derived proteins of interest. 
Methods for making monoclonal antibodies are well known (see, e.g., Harlow and Lane, 
1988, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York, which is 
) incorporated in its entirety for all purposes). In one embodiment, monoclonal antibodies are 
raised against synthetic peptide fragments designed based on genomic sequence of the cell. 
With such an antibody array, proteins from the cell are contacted to the array, and their 
binding is assayed with assays known in the art. Generally, the expression, and the level of 
expression, of proteins of diagnostic or prognostic interest can be detected through 
15 immunohistochemical staining of tissue slices or sections. 

Finally, expression of marker genes in a number of tissue specimens may be 
characterized using a "tissue array" (Kononen et al, Nat. Med 4(7):844-7 (1998)). In a tissue 
array, multiple tissue samples are assessed on the same microarray. The arrays allow in situ 
detection of RNA and protein levels; consecutive sections allow the analysis of multiple 
30 samples simultaneously. 

5.3.2. MiCRGARRAYS 
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In preferred embodiments, polynucleotide microarrays are used to measure expression 
so that the expression status of each of the markers above is assessed simultaneously. 
Generally, microarrays according to the invention comprise a plurality of markers 
informative for prognosis, or outcome determination, for a particular disease or condition, 
5 and, in particular, for individuals having specific combinations of genotypic or phenotypic 
characteristics of the disease or condition (i.e., that are prognosis-informative for a particular 
patient subset). 

The microarrays of the invention preferably comprise at least 2, 3, 4, 5, 7, 10, 15, 20, 
25, 30, 35, 40, 45, 50, 75, 100, 150, 200 or more of markers, or all of the markers, or any 

10 combination of markers, identified as prognosis-informative within a patient subset. The 

actual number of informative markers the microarray comprises will vary depending upon the 
particular condition of interest, the number of markers identified, and, optionally, the number 
of informative markers found to result in the least Type I error, Type II error, or Type I and 
Type II error in determination of prognosis. As used herein, "Type I error" means a false 

1 5 positive and "Type II error" means a false negative; in the example of prognosis of beast 
cancer, Type I error is the mis-characterization of an individual with a good prognosis as 
having a poor prognosis, and Type II error is the mis-characterization of an individual with a 
poor prognosis as having a good prognosis. 



20 prognosis markers identified for a particular patient subset comprise at least 50%, 60%, 70%, 
80%, 85%, 90%, 95% or 98% of the probes on said array. In another specific embodiment, 
the microarray comprises a plurality of probes, wherein said plurality of probes comprise 
probes complementary and hybridizable to at least 75% of the prognosis-informative markers 
identified for a particular patient subset. Microarrays of the invention, of course, may 

25 comprise probes complementary and hybridizable to prognosis-informative markers for a 

plurality of the patient subsets, or for each patient subset, identified for a particular condition. 
In another embodiment, therefore, the microarray of the invention comprises a plurality of 
probes complementary and hybridizable to at least 75% of the prognosis-informative markers 
identified for each patient subset identified for the condition of interest, and wherein said 

30 probes, in total, are at least 50% of the probes on said microarray. 

In yet another specific embodiment, microarrays that are used in the methods 
disclosed herein optionally comprise markers additional to at least some of the markers 



In specific embodiments, the invention provides polynucleotide arrays in which the 
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identified by the methods disclosed elsewhere herein. For example, in a specific 
embodiment, the microarray is a screening or scanning array as described in Altschuler et ah, 
International Publication WO 02/18646, published March 7, 2002 and Scherer et al, 
International Publication WO 02/16650, published February 28, 2002. The scanning and 
5 screening arrays comprise regularly-spaced, positionally-addressable probes derived from 
genomic nucleic acid sequence, both expressed and unexpressed. Such arrays may comprise 
probes corresponding to a subset of, or all of, the markers identified for the patient subset(s) 
for the condition of interest, and can be used to monitor marker expression in the same way 
as a microarray containing only prognosis-informative markers otherwise identified. 

10 In yet another specific embodiment, the microarray is a commercially-available 

cDNA microarray that comprises at least five markers identified by the methods described 
herein. Preferably, a commercially-available cDNA microarray comprises all of the markers 
identified by the methods described herein as being informative for a patient subset for a 
particular condition. However, such a microarray may comprise at least 5, 10, 15 or 25 of 

1 5 such markers, up to the maximum number of markers identified. 

In an embodiment specific to breast cancer, the invention provides for oligonucleotide 
or cDNA arrays comprising probes hybridizable to the genes corresponding to each of the 
marker sets described above (i.e., markers informative for ER~ sporadic individuals, markers 
informative for ER~ BRCA1 individuals, markers informative for ER+, ER/AGE high 
20 individuals, markers informative for ER+, ER/AGE low, LN+ individuals, and markers 
informative for ER+, ER/AGE low, LN~ individuals, as shown in Tables 1-5). Any of the 
microarrays described herein may be provided in a sealed container in a kit. 

The invention provides microarrays containing probes useful for the prognosis of any 
breast cancer patient, or for breast cancer patients classified into one of a plurality of patient 

25 subsets. In particular, the invention provides polynucleotide arrays comprising probes to a 
subset or subsets of at least 5, 10, 15, 20, 25 or more of the genetic markers, or up to the full 
set of markers, in any of Tables 1-5, which distinguish between patients with good and poor 
prognosis. In certain embodiments, therefore, the invention provides microarrays comprising 
probes for a plurality of the genes for which markers are listed in Tables 1, 2, 3, 4 or 5. In a 

30 specific embodiment, the microarray of the invention comprises 1 , 2, 3, 4, 5 or 10 of the 

markers in Table 1, at least five of the markers in Table 2; 1, 2, 3, 4, 5 or 10 of the markers in 
Table 3; 1, 2, 3, 4, 5 or 10 of the markers in Table 4; or 1, 2, 3, 4, 5 or 10 of the markers in 



-91- 





WO 2006/084272 



PC17US2006/004280 



Table 1. In other embodiments, the microarray comprises probes for 1, 2, 3, 4, 5, or 10 of the 
markers shown in any two, three or four of Tables 1-5, or all of Tables 1-5. In other 
embodiments, the microarray of the invention contains each of the markers in Table 1, Table 
2, Table 3, Table 4, or Table 5. In another embodiment, the microarray contains all of the 
5 markers shown in Tables 1-5. In specific embodiments, the array comprises probes derived 
only from the markers listed in Table 1, Table 2, Table 3, Table 4, or Table 5; probes derived 
from any two of Tables 1-5; any three of Tables 1-5; any four of Tables 1-5; or all of Tables 
1-5. 

In other embodiments, the array comprises a plurality of probes derived from markers 
1 0 listed in any of Tables 1 -5 in combination with a plurality of other probes, derived from 

markers not listed in any of Tables 1-5, that are identified as informative for the prognosis of 
breast cancer. 

In specific embodiments, the invention provides polynucleotide arrays in which the 
breast cancer prognosis markers described herein in Tables 1, 2, 3, 4 and/or 5 comprise at 

1 5 least 50%, 60%, 70%, 80%, 85%, 90%, 95% or 98% of the probes on said array. In another 
specific embodiment, the microarray comprises a plurality of probes, wherein said plurality 
of probes comprise probes complementary and hybridizable to at least 75% of the genes for 
which markers are listed in Table 1; probes complementary and hybridizable to at least 75% 
of the genes for which markers are listed in Table 2; probes complementary and hybridizable 

20 to at least 75% of the genes for which markers are listed in Table 3; probes complementary 
and hybridizable to at least 75% of the genes for which markers are listed in Table 4; and 
probes complementary and hybridizable to at least 75% of the genes for which markers are 
listed in Table 5, wherein said probes, in total, are at least 50% of the probes on said 



disclosed herein optionally comprise markers additional to at least some of the markers listed 
in Tables 1-5. For example, in a specific embodiment, the microarray is a screening or 
scanning array as described in Altschuler et al., International Publication WO 02/18646, 
published March 7, 2002 and Scherer et al, International Publication WO 02/16650, 
30 published February 28, 2002. The scanning and screening arrays comprise regularly-spaced, 
positionally-addressable probes derived from genomic nucleic acid sequence, both expressed 
and unexpressed. Such arrays may comprise probes corresponding to a subset of, or all of, 



microarray. 



25 



In yet another specific embodiment, microarrays that are used in the methods 
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the markers listed in Tables 1-5, or a subset thereof as described above, and can be used to 
monitor marker expression in the same way as a microarray containing only markers listed in 
Tables 1-5. 

In yet another specific embodiment, the microarray is a commercially-available 
5 cDNA microarray that comprises at least five of the markers listed in Tables 1-5. Preferably, 
a commercially-available cDNA microarray comprises all of the markers listed in Tables 1-5. 
However, such a microarray may comprise at least 5, 10, 15 or 25 of the markers in any of 
Tables 1-5, up to the maximum number of markers in a Table, and may comprise all of the 
markers in any one of Tables 1-5, and a subset of another of Tables 1-5, or subsets of each as 
1 0 described above. In a specific embodiment of the microarrays used in the methods disclosed 
herein, the markers that are all or a portion of Tables 1-5 make up at least 50%, 60%, 70%, 
80%, 90%, 95% or 98% of the probes on the microarray. 

General methods pertaining to the construction of microarrays comprising the marker 
sets and/or subsets above are described in the following sections. 



Microarrays are prepared by selecting probes which comprise a polynucleotide 
sequence, and then immobilizing such probes to a solid support or surface. For example, the 
probes may comprise DNA sequences, RNA sequences, or copolymer sequences of DNA and 
RNA. The polynucleotide sequences of the probes may also comprise DNA and/or RNA 
20 analogues, or combinations thereof. For example, the polynucleotide sequences of the probes 
may be full or partial fragments of genomic DNA. The polynucleotide sequences of the 
probes may also be synthesized nucleotide sequences, such as synthetic oligonucleotide 
sequences. The probe sequences can be synthesized either enzymatically in vivo, 
enzymatically in vitro (e.g., by PCR), or non-enzymatically in vitro. 

25 The probe or probes used in the methods of the invention are preferably immobilized 

to a solid support which may be either porous or non-porous. For example, the probes of the 
invention may be polynucleotide sequences which are attached to a nitrocellulose or nylon 
membrane or filter covalently at either the 3' or the 5' end of the polynucleotide. Such 
hybridization probes are well known in the art (see, e.g., Sambrook et al., Molecular 

30 Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York (1989). Alternatively, the solid support or surface may be a 



15 
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glass or plastic surface. In a particularly preferred embodiment, hybridization levels are 
measured to microarrays of probes consisting of a solid phase on the surface of which are 
immobilized a population of polynucleotides, such as a population of DNA or DNA mimics, 
or, alternatively, a population of RNA or RNA mimics. The solid phase may be a nonporous 
5 or, optionally, a porous material such as a gel. 

In preferred embodiments, a microarray comprises a support or surface with an 
ordered array of binding (e.g., hybridization) sites or "probes" each representing one of the 
markers described herein. Preferably the microarrays are addressable arrays, and more 
preferably positionally addressable arrays. More specifically, each probe of the array is 
1 0 preferably located at a known, predetermined position on the solid support such that the 
identity (i.e., the sequence) of each probe can be determined from its position in the array 
(i.e., on the support or surface). In preferred embodiments, each probe is covalently attached 
to the solid support at a single site. 

Microarrays can be made in a number of ways, of which several are described below. 

1 5 However produced, microarrays share certain characteristics. The arrays are reproducible, 
allowing multiple copies of a given array to be produced and easily compared with each 
other. Preferably, microarrays are made from materials that are stable under binding (e.g., 
nucleic acid hybridization) conditions. The microarrays are preferably small, e.g., between 1 
cm 2 and 25 cm 2 , between 12 cm 2 and 13 cm 2 , or 3 cm 2 . However, larger arrays are also 

20 contemplated and may be preferable, e.g., for use in screening arrays. Preferably, a given 
binding site or unique set of binding sites in the microarray will specifically bind (e.g., 
hybridize) to the product of a single gene in a cell (e.g., to a specific mRNA, or to a specific 
cDNA derived therefrom). However, in general, other related or similar sequences will cross 
hybridize to a given binding site. 

25 The microarrays of the present invention include one or more test probes, each of 

which has a polynucleotide sequence that is complementary to a subsequence of RNA or 
DNA to be detected. Preferably, the position of each probe on the solid surface is known. 
Indeed, the microarrays are preferably positionally addressable arrays. Specifically, each 
probe of the array is preferably located at a known, predetermined position on the solid 

30 support such that the identity (i.e., the sequence) of each probe can be determined from its 
position on the array (i.e., on the support or surface). 
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According to the invention, the microarray is an array {i.e., a matrix) in which each 
position represents one of the markers described herein. For example, each position can 
contain a DNA or DNA analogue based on genomic DNA to which a particular RNA or 
cDNA transcribed from that genetic marker can specifically hybridize. The DNA or DNA 
5 analogue can be, e.g., a synthetic oligomer or a gene fragment. In one embodiment, probes 
representing each of the markers is present on the array. In a preferred embodiment, the array 
comprises probes for each of the markers listed in Tables 1-5. 



10 specifically hybridizes according to the invention contains a complementary genomic 
polynucleotide sequence. The probes of the microarray preferably consist of nucleotide 
sequences of no more than 1,000 nucleotides. In some embodiments, the probes of the array 
consist of nucleotide sequences of 10 to 1,000 nucleotides. In a preferred embodiment, the 
nucleotide sequences of the probes are in the range of 10-200 nucleotides in length and are 

1 5 genomic sequences of a species of organism, such that a plurality of different probes is 
present, with sequences complementary and thus capable of hybridizing to the genome of 
such a species of organism, sequentially tiled across all or a portion of such genome. In other 
specific embodiments, the probes are in the range of 10-30 nucleotides in length, in the range 
of 10-40 nucleotides in length, in the range of 20-50 nucleotides in length, in the range of 40- 

20 80 nucleotides in length, in the range of 50- 1 50 nucleotides in length, in the range of 80-1 20 
nucleotides in length, and most preferably are 60 nucleotides in length. 

The probes may comprise DNA or DNA "mimics" {e.g., derivatives and analogues) 
corresponding to a portion of an organism's genome. In another embodiment, the probes of 
the microarray are complementary RNA or RNA mimics. DNA mimics are polymers 
25 composed of subunits capable of specific, Watson-Crick-like hybridization with DNA, or of 
specific hybridization with RNA. The nucleic acids can be modified at the base moiety, at 
the sugar moiety, or at the phosphate backbone. Exemplary DNA mimics include, e.g., 
phosphorothioates. 

DNA can be obtained, e.g., by polymerase chain reaction (PCR) amplification of 
30 genomic DNA or cloned sequences. PCR primers are preferably chosen based on a known 
sequence of the genome that will result in amplification of specific fragments of genomic 



5.3.2.2. PREPARING PROBES FOR MICROARRAYS 



As noted above, the "probe" to which a particular polynucleotide molecule 
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DNA. Computer programs that are well known in the art are useful in the design of primers 
with the required specificity and optimal amplification properties, such as Oligo version 5.0 
(National Biosciences). Typically each probe on the microarray will be between 10 bases 
and 50,000 bases, usually between 300 bases and 1,000 bases in length. PCR methods are 
5 well known in the art, and are described, for example, in Innis et al. , eds., PCR Protocols: 
A Guide to Methods and Applications, Academic Press Inc., San Diego, CA (1990). It 
will be apparent to one skilled in the art that controlled robotic systems are useful for 
isolating and amplifying nucleic acids. 

An alternative, preferred means for generating the polynucleotide probes of the 
1 0 microarray is by synthesis of synthetic polynucleotides or oligonucleotides, e.g. , using N- 

phosphonate or phosphoramidite chemistries (Froehler et al, Nucleic Acid Res. 14:5399-5407 
(1986); McBride et al, Tetrahedron Lett. 24:246-248 (1983)). Synthetic sequences are 
typically between about 10 and about 500 bases in length, more typically between about 20 
and about 100 bases, and most preferably between about 40 and about 70 bases in length. In 
1 5 some embodiments, synthetic nucleic acids include non-natural bases, such as, but by no 
means limited to, inosine. As noted above, nucleic acid analogues may be used as binding 
sites for hybridization. An example of a suitable nucleic acid analogue is peptide nucleic acid 
(see, e.g., Egholm etal, Nature 363:566-568 (1993); U.S. Patent No. 5,539,083). 

Probes are preferably selected using an algorithm that takes into account binding 
20 energies, base composition, sequence complexity, cross-hybridization binding energies, and 
secondary structure. See Friend et al., International Patent Publication WO 01/05935, 
published January 25, 2001; Hughes et al, Nat. Biotech. 19:342-7 (2001). 

A skilled artisan will also appreciate that positive control probes, e.g., probes known 
to be complementary and hybridizable to sequences in the target polynucleotide molecules, 

25 and negative control probes, e.g. , probes known to not be complementary and hybridizable to 
sequences in the target polynucleotide molecules, should be included on the array. In one 
embodiment, positive controls are synthesized along the perimeter of the array. In another 
embodiment, positive controls are synthesized in diagonal stripes across the array. In still 
another embodiment, the reverse complement for each probe is synthesized next to the 

30 position of the probe to serve as a negative control. In yet another embodiment, sequences 
from other species of organism are used as negative controls or as "spike-in" controls. 
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5.3.2.3. ATTACHING PROBES TO THE SOLID SURFACE 



The probes are attached to a solid support or surface, which may be made, e.g., from 
glass, plastic (e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, gel, or other porous 
or nonporous material. A preferred method for attaching the nucleic acids to a surface is by 
5 printing on glass plates, as is described generally by Schena et al, Science 270:467-470 

(1995) . This method is especially useful for preparing microarrays of cDNA (See also, 
DeRisi et al, Nature Genetics 14:457-460 (1996); Shalon et al, Genome Res. 6 :639-645 

(1996) ; and Schena et al, Proc. Natl. Acad. Sci. U.S.A. 93:10539-1 1286 (1995)). 

A second preferred method for making microarrays is by making high-density 
10 oligonucleotide arrays. Techniques are known for producing arrays containing thousands of 
oligonucleotides complementary to defined sequences, at defined locations on a surface using 
photolithographic techniques for synthesis in situ (see, Fodor et al., 1991, Science 251:767- 
773; Pease etal, 1994, Proc. Natl. Acad. Sci. U.S.A. 91:5022-5026; Lockhart et al., 1996, 
Nature Biotechnology 14:1675; U.S. Patent Nos. 5,578,832; 5,556,752; and 5,510,270) or 
1 5 other methods for rapid synthesis and deposition of defined oligonucleotides (Blanchard et 
al. , Biosensors & Bioelectronics 1 1 :687-690). When these methods are used, 
oligonucleotides (e.g., 60-mers) of known sequence are synthesized directly on a surface such 
as a derivatized glass slide. Usually, the array produced is redundant, with several 
oligonucleotide molecules per RNA. 

20 Other methods for making microarrays, e.g., by masking (Maskos and Southern, 

1992, Nuc. Acids. Res. 20:1679-1684), may also be used. In principle, and as noted supra, 
any type of array, for example, dot blots on a nylon hybridization membrane (see Sambrook 
et al., Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York (1989)) could be used. However, as will 

25 be recognized by those skilled in the art, very small arrays will frequently be preferred 
because hybridization volumes will be smaller. 

In one embodiment, the arrays of the present invention are prepared by synthesizing 
polynucleotide probes on a support. In such an embodiment, polynucleotide probes are 
attached to the support covalently at either the 3' or the 5' end of the polynucleotide. 

30 In a particularly preferred embodiment, microarrays of the invention are 

manufactured by means of an ink jet printing device for oligonucleotide synthesis, e.g., using 
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the methods and systems described by Blanchard in U.S. Pat. No. 6,028,189; Blanchard et 
al., 1996, Biosensors and Bioelectronics 11:687-690; Blanchard, 1998, in Synthetic DNA 
Arrays in Genetic Engineering, Vol. 20, J.K. Setlow, Ed., Plenum Press, New York at pages 
1 1 1-123. Specifically, the oligonucleotide probes in such microarrays are preferably 

5 synthesized in arrays, e.g., on a glass slide, by serially depositing individual nucleotide bases 
in "microdroplets" of a high surface tension solvent such as propylene carbonate. The 
microdroplets have small volumes (e.g., 100 pL or less, more preferably 50 pL or less) and 
are separated from each other on the microarray (e.g., by hydrophobic domains) to form 
circular surface tension wells which define the locations of the array elements (i.e., the 

1 0 different probes). Microarrays manufactured by this ink-jet method are typically of high 
density, preferably having a density of at least about 2,500 different probes per 1 cm 2 . The 
polynucleotide probes are attached to the support covalently at either the 3' or the 5' end of 
the polynucleotide. 

5.3.2.4. TARGET POLYNUCLEOTIDE MOLECULES 

1 5 The polynucleotide molecules which may be analyzed by the present invention (the 

"target polynucleotide molecules") may be from any clinically relevant source, but are 
expressed RNA or a nucleic acid derived therefrom (e.g., cDNA or amplified RNA derived 
from cDNA that incorporates an RNA polymerase promoter), including naturally occurring 
nucleic acid molecules, as well as synthetic nucleic acid molecules. In one embodiment, the 

20 target polynucleotide molecules comprise RNA, including, but by no means limited to, total 
cellular RNA, poly(A) + messenger RNA (mRNA) or fraction thereof, cytoplasmic mRNA, or 
RNA transcribed from cDNA (i.e., cRNA; see, e.g., Linsley & Schelter, U.S. Patent 
Application No. 09/411,074, filed October 4, 1999, or U.S. Patent Nos. 5,545,522, 5,891,636, 
or 5,716,785). Methods for preparing total and poly(A) + RNA are well known in the art, and 

25 are described generally, e.g. , in Sambrook et al. , Molecular Cloning - A Laboratory 
Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York (1989). In one embodiment, RNA is extracted from cells of the various types of interest 
in this invention using guanidinium thiocyanate lysis followed by CsCl centrifugation 
(Chirgwin et al, 1979, Biochemistry 18:5294-5299). In another embodiment, total RNA is 

30 extracted using a silica gel-based column, commercially available examples of which include 
RNeasy (Qiagen, Valencia, California) and StrataPrep (Stratagene, La Jolla, California). In 
an alternative embodiment, which is preferred for S. cerevisiae, RNA is extracted from cells 
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using phenol and chloroform, as described in Ausubel et al., eds., 1989, Current 
Protocols in Molecular Biology, Vol. Ill, Green Publishing Associates, Inc., John Wiley 
& Sons, Inc., New York, at pp. 13.12.1-13.12.5). Poly(A) + RNA can be selected, e.g., by 
selection with oligo-dT cellulose or, alternatively, by oligo-dT primed reverse transcription of 
5 total cellular RNA. In one embodiment, RNA can be fragmented by methods known in the 
art, e.g., by incubation with ZnCl 2 , to generate fragments of RNA. In another embodiment, 
the polynucleotide molecules analyzed by the invention comprise cDNA, or PCR products of 
amplified RNA or cDNA. 

In one embodiment, total RNA, mRNA, or nucleic acids derived therefrom, is isolated 
1 0 from a sample taken from a person afflicted with breast cancer. Target polynucleotide 

molecules that are poorly expressed in particular cells may be enriched using normalization 
techniques (Bonaldo et al, 1996, Genome Res. 6:791-806). 

As described above, the target polynucleotides are detectably labeled at one or more 
nucleotides. Any method known in the art may be used to detectably label the target 

1 5 polynucleotides. Preferably, this labeling incorporates the label uniformly along the length of 
the RNA, and more preferably, the labeling is carried out at a high degree of efficiency. One 
embodiment for this labeling uses oligo-dT primed reverse transcription to incorporate the 
label; however, conventional methods of this method are biased toward generating 3' end 
fragments. Thus, in a preferred embodiment, random primers (e.g., 9-mers) are used in 

20 reverse transcription to uniformly incorporate labeled nucleotides over the full length of the 
target polynucleotides. Alternatively, random primers may be used in conjunction with PCR 
methods or T7 promoter-based in vitro transcription methods in order to amplify the target 
polynucleotides. 

In a preferred embodiment, the detectable label is a luminescent label. For example, 
25 fluorescent labels, bioluminescent labels, chemiluminescent labels, and colorimetric labels 
may be used in the present invention. In a highly preferred embodiment, the label is a 
fluorescent label, such as a fluorescein, a phosphor, a rhodamine, or a polymethine dye 
derivative. Examples of commercially available fluorescent labels include, for example, 
fluorescent phosphoramidites such as FluorePrime (Amersham Pharmacia, Piscataway, N.J.), 
30 Fluoredite (Millipore, Bedford, Mass.), FAM (ABI, Foster City, Calif.), and Cy3 or Cy5 
(Amersham Pharmacia, Piscataway, N.J.). In another embodiment, the detectable label is a 
radiolabeled nucleotide. 
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In a further preferred embodiment, target polynucleotide molecules from a patient 
sample are labeled differentially from target polynucleotide molecules of a standard. The 
standard can comprise target polynucleotide molecules from normal individuals (i.e., those 
not afflicted with breast cancer). In a highly preferred embodiment, the standard comprises 
target polynucleotide molecules pooled from samples from normal individuals or tumor 
samples from individuals having sporadic-type breast tumors. In another embodiment, the 
target polynucleotide molecules are derived from the same individual, but are taken at 
different time points, and thus indicate the efficacy of a treatment by a change in expression 
of the markers, or lack thereof, during and after the course of treatment (i.e., chemotherapy, 
radiation therapy or cryotherapy), wherein a change in the expression of the markers from a 
poor prognosis pattern to a good prognosis pattern indicates that the treatment is efficacious. 
In this embodiment, different timepoints are differentially labeled. 



Nucleic acid hybridization and wash conditions are chosen so that the target 
polynucleotide molecules specifically bind or specifically hybridize to the complementary 
polynucleotide sequences of the array, preferably to a specific array site, wherein its 
complementary DNA is located. 

Arrays containing double-stranded probe DNA situated thereon are preferably 
subjected to denaturing conditions to render the DNA single-stranded prior to contacting with 
the target polynucleotide molecules. Arrays containing single-stranded probe DNA (e.g., 
synthetic oligodeoxyribonucleic acids) may need to be denatured prior to contacting with the 
target polynucleotide molecules, e.g., to remove hairpins or dimers which form due to self 
complementary sequences. 

Optimal hybridization conditions will depend on the length (e.g., oligomer versus 
polynucleotide greater than 200 bases) and type (e.g., RNA, or DNA) of probe and target 
nucleic acids. One of skill in the art will appreciate that as the oligonucleotides become 
shorter, it may become necessary to adjust their length to achieve a relatively uniform 
melting temperature for satisfactory hybridization results. General parameters for specific 
(i.e., stringent) hybridization conditions for nucleic acids are described in Sambrook et al, 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York (1989), and in Ausubel et al., CURRENT 



5.3.2.5. HYBRIDIZATION TO MICROARRAYS 
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PROTOCOLS in Molecular Biology, vol. 2, Current Protocols Publishing, New York 
(1994). Typical hybridization conditions for the cDNA microarrays of Schena et al. are 
hybridization in 5 X SSC plus 0.2% SDS at 65°C for four hours, followed by washes at 25°C 
in low stringency wash buffer (1 X SSC plus 0.2% SDS), followed by 10 minutes at 25°C in 
5 higher stringency wash buffer (0. 1 X SSC plus 0.2% SDS) (Schena et al. , Proc. Natl. Acad. 
Sci. U.S.A. 93.10614 (1993)). Useful hybridization conditions are also provided in, e.g., 
Tijessen, 1993, Hybridization With Nucleic Acid Probes, Elsevier Science Publishers 
B.V.; and Kricka, 1992, Nonisotopic DNA Probe Techniques , Academic Press, San 
Diego, CA. 

1 0 Particularly preferred hybridization conditions include hybridization at a temperature 

at or near the mean melting temperature of the probes (e.g., within 5FC, more preferably 
within 21°C) in 1 M NaCl, 50 mM MES buffer (pH 6.5), 0.5% sodium sarcosine and 30% 
formamide. 

5.3.2.6. SIGNAL DETECTION AND DATA ANALYSIS 

1 5 When fluorescently labeled probes are used, the fluorescence emissions at each site of 

a microarray may be, preferably, detected by scanning confocal laser microscopy. In one 
embodiment, a separate scan, using the appropriate excitation line, is carried out for each of 
the two fluorophores used. Alternatively, a laser may be used that allows simultaneous 
specimen illumination at wavelengths specific to the two fluorophores and emissions from 

20 the two fluorophores can be analyzed simultaneously (see Shalon et al., 1996, "A DNA 
microarray system for analyzing complex DNA samples using two-color fluorescent probe 
hybridization," Genome Research 6:639-645, which is incorporated by reference in its 
entirety for all purposes). In a preferred embodiment, the arrays are scanned with a laser 
fluorescent scanner with a computer controlled X-Y stage and a microscope objective. 

25 Sequential excitation of the two fluorophores is achieved with a multi-line, mixed gas laser 
and the emitted light is split by wavelength and detected with two photomultiplier tubes. 
Fluorescence laser scanning devices are described in Schena et al., Genome Res. 6:639-645 
(1996), and in other references cited herein. Alternatively, the fiber-optic bundle described 
by Ferguson et al, Nature Biotech. 14:1681-1684 (1996), may be used to monitor mRNA 

30 abundance levels at a large number of sites simultaneously. 
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Signals are recorded and, in a preferred embodiment, analyzed by computer, e.g., 
using a 12 or 16 bit analog to digital board. In one embodiment the scanned image is 
despeckled using a graphics program (e.g., Hijaak Graphics Suite) and then analyzed using an 
image gridding program that creates a spreadsheet of the average hybridization at each 
5 wavelength at each site. If necessary, an experimentally determined correction for "cross 
talk" (or overlap) between the channels for the two fluors may be made. For any particular 
hybridization site on the transcript array, a ratio of the emission of the two fluorophores can 
be calculated. The ratio is independent of the absolute expression level of the cognate gene, 
but is useful for genes whose expression is significantly modulated in association with the 
10 different breast cancer-related condition. 

5.4. THERAPEUTIC REGIMENS SPECIFIC TO PATIENT SUBSETS 
The benefit of identifying subsets of individuals that have a common condition, 
followed by identification of sets of genes informative for those particular subsets of 
individuals, is that such subdivision and identification tends to more accurately identify the 

1 5 subset of genes responsible for, or most closely associated with, a particular form of the 
condition. For example, breast cancer is a complex condition brought about by several 
different molecular mechanisms. ER+ individuals, particularly ER+, ER/AGE high 
individuals, show an increased level of expression of cell cycle-control genes, and the 
expression of these genes is highly informative for prognosis in this patient subset (see 

20 Examples). In ER~ individuals, however, the expression of these genes is not informative for 
prognosis. 

The set of informative markers, therefore, can be used to assign a particular course of 
therapy to an individual, e.g., an individual having breast cancer, depending upon the 
condition subset into which the individual is classified. In one embodiment, therefore, the 

25 invention provides a method of assigning a course of therapy to an individual having a 

condition, said method comprising classifying the individual into one of a plurality of subsets 
of a condition, wherein a plurality of informative genes has been identified for at least one of 
said subsets; and assigning a course of therapy known or suspected to be effective for treating 
the subset of the condition associated with those genes. In a specific embodiment, said 

30 condition is breast cancer, said patient subset is ER+, ER/AGE high status, and said course of 
therapy comprises the administration of one or more compounds known or suspected to be 
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effective at arresting the cell cycle. In a more specific embodiment, said one or more 
compounds comprises taxol or a vinca alkaloid. 

Of course, any course of therapy selected or assigned on the basis of the above 
phenotypes and gene expression may be supplemented by other treatments or courses of 
5 therapy relevant to or known or suspected to be effective in the treatment of the condition. 
For example, the treatment of breast cancer may additionally comprise surgery, either tissue- 
preserving or radical, radiation treatment, chemotherapy other than that suggested by gene 
expression analysis, or any other therapy or treatment known or suspected to be effective. 

5.5. CLINICAL TRIALS AND EPIDEMIOLOGICAL STUDIES 
10 The method of the present invention may also be used to assign individuals to 

categories within a clinical trial, epidemiological study or the like. For example, individuals 
may be distinguished according to a characteristic of a condition, such as the presence or 
absence of specific proteins (e.g., estrogen receptor) or tissue structures (e.g., lymph nodes), 
and with prognosis, and the results of the trial correlated with prognosis. In a specific 
1 5 example, the condition is breast cancer, the characteristic is the presence of the estrogen 
receptor, and the prognosis is the expected occurrence or non-occurrence of metastases 
within a given period, for example, five years, after initial diagnosis. In another specific 
example, the condition is obesity, the characteristics are 24-hour energy expenditure, and the 
prognosis is the expected occurrence of heart disease or diabetes. In another specific 
20 example, the condition is a neurodegenerative disease, the characteristic is exposure to a 
particular range of concentration of an environmental toxin, and the prognosis is expected 
occurrence or degree of loss of motor function. In each case, the characteristics and expected 
outcome are used to assign the individual to a category within a clinical trial or 
epidemiological study. 

25 Thus, the invention provides a method for assigning an individual to one of a plurality 

of categories in a clinical trial, comprising classifying the individual into one of a plurality of 
condition categories differentiated by at least one genotypic or phenotypic characteristic of 
the condition; determining the level of expression, in a sample derived from said individual, 
of a plurality of genes informative for said condition category; determining whether said level 

30 of expression of said plurality of genes indicates that the individual has a good prognosis or a 
poor prognosis; and assigning the individual to a category in a clinical trial on the basis of 
prognosis. 
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In a specific embodiment, the invention provides a method of assigning an individual 
to a category in a breast cancer clinical trial, said method comprising: (a) classifying said 
individual as ER", BRCA1, ER~, sporadic; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or 
ER+, ER/AGE low, LN"; (b) determining for said individual the level of expression of at 

5 least two genes for which markers are listed in Table 1 if said individual is classified as ER", 
BRCA1; Table 2 if said individual is classified as ER" sporadic; Table 3 if said individual is 
classified as ER+, ER/AGE high; Table 4 if said individual is classified as ER+, ER/AGE 
low, LN+; or Table 5 if said individual is classified as ER+, ER/AGE low, LN"; (c) 
determining whether said individual has a pattern of expression of said at least two genes that 

1 0 correlates with a good prognosis or a poor prognosis; and (d) assigning said individual to at 
least one category in a clinical trial if said individual has a good prognosis, and assigning said 
individual to a second category in said clinical trial if said individual has a poor prognosis. In 
a more specific embodiment, said individual is additionally assigned to a category in said 
clinical trial on the basis of the classification of said individual as determined in step (a). In 

15 another more specific embodiment, said individual is additionally assigned to a category in 
said clinical trial on the basis of any other clinical, phenotypic or genotypic characteristic of 
breast cancer. In another more specific embodiment, the method additionally comprises 
determining in said cell sample the level of expression, relative to a control, of a second 
plurality of genes for which markers are not found in Tables 1-5, wherein said second 

20 plurality of genes is informative for prognosis of breast cancer, and determining from the 
expression of said second plurality of genes, in addition to said first plurality of genes, 
whether said individual has a good prognosis or a poor prognosis. 

5.6. KITS 

The present invention further provides for kits comprising the marker sets above. The 
25 components of the kits of the present invention are preferably contained in sealed containers. 
In a preferred embodiment, the kit comprises a microarray ready for hybridization to target 
polynucleotide molecules. In specific embodiments, the kit may comprise any of the 
microarrays described in detail in Section 5.3.2. Where proteins are the target molecules, the 
kit preferably comprises a plurality of antibodies for binding to specific condition-related 
30 proteins, and means for identifying such binding (e.g., means for performing a sandwich 

assay, ELISA, RIA, etc.). Such antibodies may be provided, for example, individually or as 
part of an antibody array. The kit may additionally comprise software for the data analyses 
described above, as described in detail in Section 5.7. The kit preferably contains one or 
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more controls. Such a control may be an artificial population of marker-related or marker- 
derived polynucleotides suitable for hybridization to a microarray, wherein the markers are 
related to or relevant to the condition of interest (for example, breast cancer). The control 
may also, or alternatively, be a set of expression values stored on a computer disk or other 
5 storage medium. 

The kits of the invention may be primarily diagnostic in nature; that is, they may 
assist a physician or researcher in deterniining a characteristic, for example, the prognosis, of 
a condition of interest, the likely response to a therapeutic regimen, the likely outcome of 
exposure to an environmental condition, such as toxin exposure, etc. The kits of the 
10 invention may also be used to classify individuals, for example, to place individuals into 
different groups in a clinical trial. The use of each kit is determined by the markers, 
microarrays, controls, etc. included. 

5.7. COMPUTER-FACILITATED ANALYSIS 
The analytic methods described in the previous sections can be implemented by use of 
1 5 the following computer systems and according to the following programs and methods. A 
computer system comprises internal components linked to external components. The internal 
components of a typical computer system include a processor element interconnected with a 
main memory. For example, the computer system can be based on an Intel 8086-, 803 86-, 
80486-, Pentium™, or Pentium™-based processor with preferably 32 MB or more of main 
20 memory. The computer system may also be a Macintosh or a Macintosh-based system, but 
may also be a minicomputer or mainframe. 

The external components preferably include mass storage. This mass storage can be 
one or more hard disks (which are typically packaged together with the processor and 
memory). Such hard disks are preferably of 1 GB or greater storage capacity. Other external 
25 components include a user interface device, which can be a monitor, together with an 

inputting device, which can be a "mouse", or other graphic input devices, and/or a keyboard. 
A printing device can also be attached to the computer. 

Typically, a computer system is also linked to network link, which can be part of an 
Ethernet link to other local computer systems, remote computer systems, or wide area 
30 communication networks, such as the Internet. This network link allows the computer 
system to share data and processing tasks with other computer systems. 
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Loaded into memory during operation of this system are several software 
components, which are both standard in the art and special to the instant invention. These 
software components collectively cause the computer system to function according to the 
methods of this invention. These software components are typically stored on the mass 
storage device. A software component comprises the operating system, which is responsible 
for managing computer system and its network interconnections. This operating system can 
be, for example, of the Microsoft Windows® family, such as Windows 3.1, Windows 95, 
Windows 98, Windows 2000, or Windows NT, or may be of the Macintosh OS family, or 
may be UNIX, a UNIX derivative such as LINUX, or an operating system specific to a 
minicomputer or mainframe. The software component represents common languages and 
functions conveniently present on this system to assist programs implementing the methods 
specific to this invention. Many high or low level computer languages can be used to 
program the analytic methods of this invention. Instructions can be interpreted during run- 
time or compiled. Preferred languages include C/C++, FORTRAN and JAVA. Most 
preferably, the methods of this invention are programmed in mathematical software packages 
that allow symbolic entry of equations and high-level specification of processing, including 
some or all of the algorithms to be used, thereby freeing a user of the need to procedurally 
program individual equations or algorithms. Such packages include Mathlab from 
Mathworks (Natick, MA), Mathematica® from Wolfram Research (Champaign, IL), or S- 
Plus® from Math Soft (Cambridge, MA). Specifically, the software component includes the 
analytic methods of the invention as programmed in a procedural language or symbolic 
package. 

The software to be included with the kit comprises the data analysis methods of the 
invention as disclosed herein. In particular, the software may include mathematical routines 
for marker discovery, mcluding the calculation of similarity values between clinical 
categories (e.g., prognosis) and marker expression. The software may also include 
mathematical routines for calculating the similarity between sample marker expression and 
template marker expression, using array-generated fluorescence data, to determine the 
clinical classification of a sample. 

Additionally, the software may also include mathematical routines for determining the 
prognostic outcome, and the recommended therapeutic regimen, for an individual with a 
condition of interest. In the specific example of breast cancer, the mathematical routines 
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would determine the prognostic outcome and recommended therapeutic regimen for an 
individual having breast cancer. Such breast cancer-specific software would include 
instructions for the computer system's processor to receive data structures that include the 
level of expression of five or more of the marker genes listed in any of Tables 1-5 in a breast 
5 cancer tumor sample obtained from the breast cancer patient; the mean level of expression of 
the same genes in a control or template; and the breast cancer patient's clinical information, 
including age, lymph node status and ER status. The software may additionally include 
mathematical routines for transforming the hybridization data and for calculating the 
similarity between the expression profile of the marker genes in the patient's breast cancer 
10 tumor sample and a template. In a specific embodiment, the software includes mathematical 
routines for calculating a similarity metric, such as a coefficient of correlation, representing 
the similarity between the expression profile for the marker genes in the patient's breast 
cancer tumor sample and the template, and expressing the similarity as that similarity metric. 

The software preferably would include decisional routines that integrate the patient's 
15 clinical and marker gene expression data, and recommend a course of therapy. In one 

embodiment, for example, the software causes the processor unit to receive expression data 
for prognosis-related genes in the patient's tumor sample, calculate a metric of similarity of 
these expression values to the values for the same genes in a template, compare this similarity 
metric to a pre-selected similarity metric threshold or thresholds that differentiate prognostic 
20 groups, assign the patient to a prognostic group, and, on the basis of the prognostic group, 
assign a recommended therapeutic regimen. In a specific example, the software additionally 
causes the processor unit to receive data structures comprising clinical information about the 
breast cancer patient. In a more specific example, such clinical information includes the 
patient's age, estrogen receptor status, and lymph node status. 

25 The software preferably causes the processor unit to receive data structures 

comprising relevant phenotypic and/or genotypic characteristics of the particular condition of 
interest, and/or of an individual having that condition, and classifies the individual into a 
condition subset according to those characteristics. The software then causes the processor to 
receive values for subset-specific markers, to calculate a metric of similarity of the values 

30 associated with those markers (e.g., level, abundance, activity, etc.) from the individual to a 
template, compare this similarity metric to a pre-selected similarity metric threshold or 
thresholds that differentiate prognostic groups, assign the patient to a prognostic group, and, 
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on the basis of the prognostic group, assign a recommended therapeutic regimen. In the 
specific example of breast cancer and a breast cancer patient, the software, in one 
embodiment, causes the processor unit to receive data structures comprising the patient's age, 
estrogen receptor status, and lymph node status, and on the basis of this data, to classify the 
5 patient into one of the following patient subsets: ERT, sporadic; ER _ , BRCA1; ER+, 

AR/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE low, LN~. The software then 
causes the processor to receive expression values for subset-specific prognosis-informative 
gene expression in the patient's tumor sample, calculate a metric of similarity of these 
expression values to the values for the same genes in a patient subset-specific template or 
10 template, compare this similarity metric to a pre-selected similarity metric threshold or 
thresholds that differentiate prognostic groups, assign the patient to the prognostic group, 
and, on the basis of the prognostic group, assign a recommended therapeutic regimen. 

Where the template is an expression template comprising expression values for 
marker genes within a group of patients, e.g., breast cancer patients, the template can 

1 5 comprise either hybridization data obtained at the same time (i. e., in the same hybridization 
experiment) as the patient's individual hybridization data, or can be a set of hybridization or 
marker expression values stores on a computer, or on computer-readable media. If the latter 
is used, new patient hybridization data for the selected marker genes, obtained from initial or 
follow-up tumor samples, or suspected tumor samples, can be compared to the stored values 

20 for the same genes. However, the software may additionally comprise routines for updating 
the template data set, e.g., to add information from additional breast cancer patients or to 
remove existing members of the template data set, and, consequently, for recalculating the 
average expression level values that comprise the template. In another specific embodiment, 
said template comprises a set of single-channel mean hybridization intensity values for each 

25 of said at least five of said genes, stored on a computer-readable medium. 

Clinical data relating to a breast cancer patient, or a patient having another type of 
condition, and used by the computer program products of the invention, can be contained in a 
database of clinical data in which information on each patient is maintained in a separate 
record, which record may contain any information relevant to the patient, the patient's 
30 medical history, treatment, prognosis, or participation in a clinical trial or study, including 
expression profile data generated as part of an initial diagnosis or for tracking the progress of 
the condition, for example, breast cancer, during treatment. 
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Thus, one embodiment of the invention provides a computer program product for 
classifying a breast cancer patient according to prognosis, the computer program product for 
use in conjunction with a computer having a memory and a processor, the computer program 
product comprising a computer readable storage medium having a computer program 
5 mechanism encoded thereon, wherein said computer program product can be loaded into the 
one or more memory units of a computer and causes the one or more processor units of the 
computer to execute the steps of (a) receiving a first data structure comprising said breast 
cancer patient's age, ER status, LN status and tumor type; (b) classifying said patient as ER~ 
sporadic; ER", BRCA1; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE 

10 low, LIST; (c) receiving a second data structure comprising the levels of expression of at least 
two genes in a cell sample taken from said breast cancer patient wherein markers for said at 
least two genes are listed in Table 1 if said patient is classified as ER - , sporadic; Table 2 if 
said patient is classified as ER - , sporadic; Table 3 if said patient is classified as ER+, 
ER/AGE high; Table 4 if said patient is classified as ER+, ER/AGE low, LN+; or Table 5 if 

15 said patient is classified as ER+, ER/AGE high, LN - ; (d) determining the similarity of the 
expression profile of said at least two genes to a template expression profile of said at least 
two genes to obtain a patient similarity value; (e) comparing said patient similarity value to 
selected first and second threshold values of similarity, respectively, wherein said second 
similarity threshold indicates greater similarity to said template expression profile than does 

20 said first similarity threshold; and (f) classifying said breast cancer patient as having a first 
prognosis if said patient similarity value exceeds said said second threshold similarity values, 
a second prognosis if said patient similarity value exceeds said first threshold similarity value 
but does not exceed said second threshold similarity value, and a third prognosis if said 
patient similarity value does not exceed said first threshold similarity value. In a specific 

25 embodiment of said computer program product, said first threshold value of similarity and 
said second threshold value of similarity are values stored in said computer. In another more 
specific embodiment, said first prognosis is a "very good prognosis," said second prognosis is 
an "intermediate prognosis," and said third prognosis is a "poor prognosis," and wherein said 
computer program mechanism may be loaded into the memory and further cause said one or 

30 more processor units of said computer to execute the step of assigning said breast cancer 
patient a therapeutic regimen comprising no adjuvant chemotherapy if the patient is lymph 
node negative and is classified as having a good prognosis or an intermediate prognosis, or 
comprising chemotherapy if said patient has any other combination of lymph node status and 
expression profile. In another specific embodiment, said computer program mechanism may 
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be loaded into the memory and further cause said one or more processor units of the 
computer to execute the steps of receiving a data structure comprising clinical data specific to 
said breast cancer patient. In a more specific embodiment, said single-channel hybridization 
intensity values are log transformed. The computer implementation of the method, however, 
5 may use any desired transformation method. In another specific embodiment, the computer 
program product causes said processing unit to perform said comparing step (e) by 
calculating the correlation between the expression profile of said genes in said cell sample 
taken from said breast cancer patient and the template expression profile of the same genes. 
In another specific embodiment, the computer program product causes said processing unit to 
1 0 perform said comparing step (e) by calculating the distance between the expression profile of 
said genes in said cell sample taken from said breast cancer patient and the template 
expression profile of the same genes. 

Of course, the above breast cancer-specific examples are not limiting; analogous 
computer systems, software, and data analysis methods may be utilized for any condition of 
1 5 interest. For example, analogous software may be used to determine the prognosis of any 
other type of cancer, or of any other non-cancer diseases or conditions, using markers, 
expression level data and controls specific for that cancer, non-cancer disease or condition. 

In an exemplary implementation, to practice the methods of the present invention, a 
user first loads experimental data into the computer system. These data can be directly 
20 entered by the user from a monitor, keyboard, or from other computer systems linked by a 
network connection, or on removable storage media such as a CD-ROM, floppy disk (not 
illustrated), tape drive (not illustrated), ZIP® drive (not illustrated) or through the network. 
Next the user causes execution of expression profile analysis software which performs the 
methods of the present invention. 

25 In another exemplary implementation, a user first loads experimental data and/or 

databases into the computer system. This data is loaded into the memory from the storage 
media or from a remote computer, preferably from a dynamic geneset database system, 
through the network. Next the user causes execution of software that performs the steps of 
the present invention. 

30 Additionally, because the data obtained and analyzed in the software and computer 

system products of the invention may be confidential, the software and/or computer system 
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preferably comprises access controls or access control routines, such as password protection 
and preferably, particularly if information is to be transmitted between computers, for 
example, over the Internet, encryption of the data by a suitable encryption algorithm (e.g., 
PGP). 

5 Alternative computer systems and software for implementing the analytic methods of 

this invention will be apparent to one of skill in the art and are intended to be comprehended 
within the accompanying claims. In particular, the accompanying claims are intended to 
include the alternative program structures for implementing the methods of this invention that 
will be readily apparent to one of skill in the art. 

10 6. EXAMPLES 



Materials and Methods 
Tumor Samples : 

15 311 cohort samples were collected from breast cancer patients. Selection criteria for 

sporadic patients (i.e., those not identified as having a BRCAl-type tumor; n = 291) included: 
primary invasive breast carcinoma less than 5 cm (Tl or T2); no axillary metastases (NO); 
age at diagnosis of less than 55 years; calendar year of diagnosis 1983-1996; and no previous 
malignancies. All patients were treated by modified radical mastectomy or breast-conserving 

20 treatment. See van't Veer et ah, Nature 415:530 (2002). Selection criteria for hereditary 
(i.e., BRCAl-type; n = 20) tumors included: carriers of germline mutation in BRCA1 or 
BRCA2, and primary invasive breast carcinoma, van't Veer, supra. Additionally, for 
development of a classifier for the BRCA1 group, 14 BRCA1 samples previously identified 
(see van't Veer, supra) were added to the 20 BRCA1 type samples to increase sample size. 

25 Those 14 samples also satisfy the conditions that they are ER negative and age less than 55 
years old. 

Data analysis : 

Sample sub-grouping: As shown in FIG. 1, tumor samples were first divided into ER+ 
and ER~ branches since this is the dominant gene expression pattern. In the ER~ branch, the 
30 samples were further divided into "BRCA1 mutation like" and "Sporadic like" categories 
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using the expression templates and 100 genes previously identified as optimal for 
determining BRCA1 status. See van't Veer et al, Nature 415:530 (2002). In the ER+ 
category, samples were divided by ER vs. age distribution (see below) into two groups, 
"ER/AGE low" and "ER/AGE high." Within the "ER/AGE low" group, samples were 
5 further divided according to the lymph node status into two sub-groups: lymph node negative 
(0 lymph nodes; LN-) and positive (> 0 lymph nodes; LN+) group. 

The result of these divisions was five distinctive sub-groups: "ER~, sporadic" (n = 52), "ER - , 
BRCA1" {n = 34), "ER+, ER/AGE high" (n = 83), "ER+, ER/AGE low, LN~" (n = 81), and 
"ER+, ER/AGE low, LN+" (n = 75). A few samples with a specific ER vs. age distribution 
1 0 in "ER+, ER/AGE low, LN+" group were further excluded to develop a classifier, see below 
for details. 

Estrogen re ceptor level : Estrogen receptor gene expression level was measured by a 60mer 
oligo-nucleotide on a microarray. Since every individual sample was compared to a pool of 
all samples, the ratio to pool was used to measure the relative level. A threshold of -0.65 
1 5 on logio(ratio) was used to separate the ER+ group from ER~ group. See van't Veer et al, 
Nature 415:530 (2002). 

Grouping by ER vs. age distribution : Samples were not uniformly distributed in ER vs. age 
space among the ER+ samples (FIG. 2). First, it appeared that the ER level increases with 
age, as there were few samples from young individuals having a high ER expression level. 

20 For example, in the 35 to 40 years age group, samples having a log(ratio) of ER > 0.2 are 

relatively few as compared to the 40 to 45 age group. In the set of samples used, the 40 < age 
< 45 group contains 30 samples having log(ratio) ER values between -0.2 to 0.2, and 28 
samples having values greater than 0.2, whereas the 35 < age < 40 group includes 24 samples 
with values between -0.2 to 0.2, but only 6 samples with values of greater than 0.2 (Fisher's 

25 exact test P-value: 1%). The increasing ER level with age may simply due to the fact that 
estrogen levels decrease with age, and the estrogen receptor level rises in compensation. 

There also appear to be at least two groups of patients, as indicated by the solid line 
separating the two in FIG. 2 A. A bimodality test of the separation indicated by the solid line 
yielded P-value < 10" 4 . Each of these two groups has its own trend between the ER level and 
3 0 age. The solid line can be approximated by ER = 0. 1 (age - 42.5). Patients having values 
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above the solid line are referred to as the "ER/AGE high" group, and the patients below the 
line as the "ER/AGE low" group. 

Prognosis in each group: 

Feature selection and performance evaluation : For the prognosis in each group, non- 
5 informative genes were filtered in each group of patients. Specifically, only genes with 
|logio(ratio)| > logi 0 (2) and P-value (for log(ratio) # 0) < 0.01 in more than 3 experiments 
were kept. This step removed all genes that never had any significant change across all 
samples. The second step used a leave-one-out cross validation (LOOCV) procedure to 
optimize the number of reporter genes (features) in the classifier and to estimate the 
10 performance of the classifier in each group. The feature selection was included inside the 
loop of each LOOCV process. The final "optimal" reporter genes were selected using all of 
the "training samples" as the result of "re-substitution" because one classifier was needed for 
each group. 

Selection of training samples : Only the samples from patients who had metastases within 5 

1 5 years of initial diagnosis (3 years for "ER~ sporadic" samples; i.e., the "poor outcome" 

group), or who were metastases-free with more than 5 years of follow-up time {i.e., the "good 
outcome" group, were used as the training set. Because the average expression levels for 
informative genes among patients who were metastasis-free, or who had early metastases, 
were used as expression templates for prediction, the training samples for the ER+ samples 

20 were further limited to those samples that could also be correctly classified by the first round 
of LOOCV process. For the "ER~ sporadic" samples, no such iteration was done because 
no improvement was observed. For the "ER~, BRCA1" samples, an iteration was done, but 
the training samples in the second iteration were limited to the correctly predicted good 
outcome samples from the first round of LOOCV, and all the poor outcome samples with 

25 metastases time less than 5 years. Further limitation of the poor outcome samples was not 
performed because of the small number of poor samples and the absence of improvement by 
such limitation. In the first round of LOOCV, except for the "ER~ sporadic" group, the 
number of features was fixed at 50 genes. A patient was predicted to have a favorable 
outcome, that is, no metastases within five years of initial diagnosis, if the expression of the 

30 reporter genes in a sample from the individual was more similar to the "average good profile" 
than the "average poor profile", and a poor outcome, that is, a metastasis within five years, if 
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the expression of the reporter genes in the sample was more similar to the "average poor 
profile" than the "average good profile". 

The justification for such an iteration operation is threefold. First, biologically, there 
are always a few individuals with specific reasons (different from the vast majority) to stay 

5 metastases free or to develop metastases. Second, statistically, most groups of patients 
include outliers that don't follow the distribution of the majority of samples. Third, 
methodologically, the iteration operation is very similar to the idea of "boosting", but instead 
of increasing the weights of the samples predicted wrong, emphasis is placed on the well 
behaved samples for selecting features and training the classifier. Since this process was 

1 0 used to select "training samples", and the performance was evaluated using the LOOCV 
(including the feature selection) after the training sample being fixed, there is no issue of 
over-fitting involved in our procedures. This method of iteration is thus more likely to reveal 
the dominant mode to metastases within each group. 

Error rate and odds ratio, threshold in the final LOOCV : Unless otherwise stated, the error 
15 rate is the average error rate from two populations: (1) the number of poor outcome samples 
misclassified as good outcome samples, divided by the total number of poor outcome 
samples; and (2) the total number of good outcome samples misclassified as poor outcome 
samples, divided by the total number of good samples. Two odds ratios are reported for a 
given threshold: (1) the overall odds ratio and (2) the 5 year odds ratio. The 5 year odds 
20 ratio was calculated from samples from individuals that were metastases free for more than 
five years, and who experienced metastasis within 5 years. The threshold was applied to 
corl - cor2, where "corl" stands for the correlation to the "average good profile" in the 
training set, and "cor2" stands for the correlation to the "average poor profile" in the training 
set. 

25 The threshold in the final round of LOOCV was defined using the following steps: 

(1) For each of the N sample i left out for training, features based on the training set were 
selected. (2) Given a feature set, an incomplete LOOCV with N-l samples was performed 
(only the "average poor profile" and "average good profile" is varied depending on whether 
the left out sample is in the training set or not). (3) The threshold based on the minimum 

30 error rate from N-l samples was determined, and that threshold was assigned to sample i in 
step (1). (4) The median threshold from all N samples was taken, and designated the final 
threshold. FIGS. 3-7 present detailed information about classifiers for the 5 groups: "ER~, 
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sporadic", "ER~, BRCA1", "ER+, ER/age high", "ER+, ER/age low, LN~", "ER+, ER/age 
low, LN+". Tables 1-5 (see Section 5.3) list the final optimal reporter genes for each of the 5 
classifiers for each of the five patient subsets. Table 6, below, summarizes the performance 
of each of the five classifiers together with thresholds used in each classifier. 

5 Table 6. Performance of classifiers for each patient subset. 



Classifier 


Optimal 
of Genes 


(C1-C2) 
Threshold 


Metastasis 
Free 


#of 
Samples 


TP 


FP 


FN 


TN 


Odds 
Ratio 


95% C.I. 


ER+, ER/AGE 
high 


50 


1.22 


Overall 


83 


31 


14 


5 


33 


14.61 


4.71- 
45.36 






5 year 


71 


24 


11 


3 


33 


24.00 


6.03- 
95.46 


ER+, ER/AGE 
low, LN- 


65 


0.38 


Overall 


81 


14 


6 


6 


55 


21.39 


5.98- 
76.52 






5 year 


73 


11 


4 


5 


53 


29.15 


6.73- 
126.33 


ER+, ER/AGE 

low, ush- 


50 


-0.12 


Overall 


56 


7 


4 


6 


39 


11.38 


2.54- 
50.94 






5 year 


48 


5 


4 


3 


36 


15.00 


2.57- 
87.64 


er-, sporadic 


20 


-0.01 


Overall 


52 


18 


7 


7 


29 


7.35 


2.16- 
25.04 








5 year 


45 


16 


5 


6 


18 


9.60 


2.45- 
37.58 


ER-, BRCA1 


10 


-0.37 


Overall 


34 


6 


3 


3 


22 


14.67 


2.34- 
92.11 








5 year 


22 


6 


1 


3 


12 


24.00 


2.04- 
282.68 



TP: True positive 
FP: False positive 
FN: False negative 
TN: True negative 



10 Classification method: All classifiers described herein, feature selection and 

optimization were included inside the LOOCV loop. Classifier performance was based on 
the LOOCV results. The profile based on the selected features from each patient was 
compared to the "average good profile" and "average poor profile" (by correlation) to 
determine its predicted outcome. 

1 5 Correlation calculation: The correlation between each gene's expression log(ratio) 

and the endpoint data (final outcome) was calculated using the Pearson's correlation 
coefficient. The correlation between each patient's profile and the "average good profile" 
and "average poor profile" is the cosine product (no mean subtraction). 

Results: 

-115- 



WO 2006/084272 



PC17US2006/004280 



We employed the comprehensive prognosis strategy on microarray expression 
profiles of 31 1 patients diagnosed before age 55 that were all part of previous studies 
establishing and validating a 70-gene prognosis profile. See van 't Veer et al., Nature 
415:530 (2002); van de Vijver et al.,N. Engl. J. Med 347:1999 (2002). In addition, 14 
5 known BRCA1 samples from the Nature study were included in defining the prognosis 
classifier for the BRCA1 group. The overview of the stratifications is shown in FIG. 1. In 
each of the patient subsets, prognosis classifiers were developed and performance was 
evaluated by leave-one-out cross-validation. The biological make up of each of the 
classifiers was also examined. 

1 0 During the process to decide whether a particular clinical parameter should be used for the 
next stratification, our objectives were twofold: (1) identification of homogeneous prognosis 
patterns; and/or (2) improved prognosis in the subsets. There is a subtle balance between 
these two objectives because smaller groups will likely lead to uniform patterns within the 
group but have increasingly limited predictive power. With the exception of the BRCA1 

15 subset, each group in our stratification contained 50 or more samples. 

The first layer of stratification was based on the estrogen receptor level. We and 
others previously observed that estrogen receptor expression has a dominant effect on overall 
gene expression in breast cancer as seen in hierarchical clustering, van 't Veer et al., Nature 
415:530 (2002); Perou et al, Nature 406:747 (2000); Gruvberger et al., Cancer Res. 61 :5979 

20 (2001). In our previous analysis up to 2500 genes are significantly correlated with ER 
expression levels in tumor, van 't Veer et al, Nature 415:530 (2002). According to the 
threshold defined previously (van de Vijver et al., N. Engl. J. Med. 347:1999 (2002)), 
samples were first divided into two groups according to the estrogen receptor level as 
measured by the oligo probe (accession number: NM_000125) on the array; samples with 

25 log(ratio) > -0.65 belong to the ER+ group, and the rest belong to ER~ group). This resulted 
in 239 samples in the ER+ group and 72 samples in the ER - group. 

In the ER+ branch we observed that when displaying ER expression level as a 
function of age, at least two subgroups appear to exist. (In general, any bimodality in the 
clinical data is useful.) We therefore decided to stratify the tumors according this bimodality 
30 (see FIG. 2). The group of ER+ patients having a high ER/AGE ratio was designated the 
"ER/AGE high" group (83 samples), and the remaining group of patients was designated 
"ER/AGE low" group (156 samples). 
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Within the "ER/age high" group, we identified a group of prognosis reporter genes 
that highly correlated with the outcome (see Table 3). Moreover, the expression of these 
genes appeared to be very homogeneous, as indicated by high similarity in expression among 
those genes. See FIG 2A. Leave-one-out cross validation including reporter selection 
yielded an odds ratio of 14.6 (95%CI: 4.7-45.4) and 5 year odds ratio of 24.0 (95%CI: 6.0- 
95.5). Examination of those reporter genes reveals they are mostly the cell cycle genes which 
are highly expressed in the poor outcome tumors. It is worth noting that even though this 
group includes LN+ and LN- individuals, and mixed treatment, the incidence of distant 
metastases is predicted by a biologically uniform set of genes, possibly indicating that 
proliferation is the prime driving force for disease progression. Also even though variation in 
these genes is observed in other tumor subgroups this is generally not correlated with 
outcome in those settings (see below). 

In the "ER/age low" group, no predictive pattern was found in the whole group; thus, 
the samples were further stratified into LN- (81 samples, referred to as "ER/age low LN~") 
and LN+ (75 samples, referred to as "ER/age low LN+") group. 

Within the "ER/age low LN - " group, a group of genes was identified that was 
uniformly co-regulated, and which correlated with the outcome. Leave-one-out cross- 
validation (including feature selection) yielded an odds ratio of 21.4 (95% CI: 6.0-76.5) and 5 
year odds ratio of 29.2 (95% CI: 6.7-126.3). This group of genes is also enriched for 
individual biological functions (see below). 

For the "ER/age low LN+" subset, an informative set of genes {see Table 4) was 
obtained after exclusion of several samples from older individuals having low ER levels. 
These samples are indicated in FIG. 2A as those lying below the dashed line (approximated 
as ER < 0.1*(age-50). 56 samples remained after the exclusion. This sample set allowed the 
identification of a group of genes with a highly homogeneous pattern that is useful for 
prognosis (overall odds ratio: 11.4 (2.5-50.9), 5 year odds ratio: 15.0 (2.6-87.6)). This 
suggests again that ER vs. age is an important combination for stratifying breast cancer 
patients. The reporter genes involved in this classifier also correlated with the clinical 
measure of the degree of lymphocytic infiltration (data not shown). The prediction in this 
group is not as strong as other positive groups, which may indicate the primary tumor carries 
weaker information about the metastases for this group of patients, and the metastases may be 
started from or influenced by tumors already in lymph nodes. 



-117- 





WO 2006/084272 



PCT/LS2006/004280 



In the ER" branch, because a portion of the samples are "BRCAl-l\ke" it is natural to 
divide the samples into "BRCAl-like" and "sporadic like". To perform the classification, the 
BRCA //sporadic tumor type classifier described in Roberts et al, "Diagnosis and Prognosis 
of Breast Cancer Patients," International Publication No. WO 02/103320, which is hereby 
5 incorporated by reference in its entirety, to segregate the ER~ cohort samples. 52 out of the 
72 ER~ samples were found to be "sporadic like" and 20 were found to be "MG4i-like". 
Interestingly, the "sporadic like" group is enriched for erbb2 mutations (data not shown). 

Within the "ER~, sporadic" group, no homogeneous prognosis pattern was identified; 
however, 20 genes were identified that are highly predictive of the tumor outcome (see Table 

10 2). Leave-one-out cross-validation including feature selection yielded an odds ratio of 7.4 
(95% CI 2.2-25.0) and 5 year odds ratio 9.6 (2.5 - 37.6). This result represents a significant 
improvement in prognosis compared to the previously-identified 70 gene prognosis classifier 
(see Roberts et al, International Publication No. WO 02/103320; van 't Veer et al, Nature 
415:530 (2002)) which has no within-group prognostic power for the ER - patient subset. 

1 5 The fact that 20 genes predict outcome and that there is no homogeneous (and apparent 

biological) pattern in this group probably indicates multiple mechanisms of metastasis in this 
group. Gene annotation indicates that genes included may be involved in invasion, energy 
metabolism and other functions. 



20 from our previous study to increase the number of samples. Those 14 extra samples also 

satisfy our selection criteria: ER negative and age less than 55 years. The leave-one-out cross 
validation process identified 10 genes that are predictive of final outcomes. The overall odds 
ratio is 14.7 (95% CI: 2.3-92.1) and the 5 year odds ratio is 24.0 (95% CI: 2.0-282.7). 

Because no homogeneous gene expression patterns were found in ER branch, the 
25 predictive power of those genes was further validated. One means of further validation was 
to review the different classifier gene sets for biological interpretations and to identify genes 
within each classifier that gave indications as to the origins of the tumors. 

The "ER+, ER/AGE high" group yielded a classifier highly enriched for cell cycle 
genes with both Gl/S and G2/M phases represented. In this group, over-expression of 46 of 
30 the 50 genes is associated with disease progression including all the known cell cycle genes. 
This is consistent with rapid growth being the determinant of metastatic potential. Four 



For the "ER - , BRCAl-WksT group, we added 14 BRCA1 mutation carrier samples 
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genes in this classifier are anti-correlated with outcome and cell cycle. One of these genes 
encodes follistatin, which binds to and inhibits activin and other members of the TGFP family 
(Lin et al, Reproduction 126:133 (2003)), the members of which have many functions, 
including growth stimulation. Tumor grade also accurately predicts metastatic potential in 
> this group (overall odds ratio: 5.9, 95% CI: 2.0-18.0, 5 year odds ratio: 12.5, 95% CI: 2.6- 
59.3) and is also correlated with the expression level of these genes, which is consistent with 
rate of growth being the primary determinant of disease progression. This set of genes has a 
significantly lower correlation with outcome in the other patient subsets, even though 
coordinate and similarly variable expression is seen. For example, many tumors in the "ER , 
0 sporadic" group have high cell cycle and low FST expression, but the expression of these 
genes in these groups is minimally correlated with outcome, indicating that growth is not the 
primary determinant of outcome here (see FIGS. 8A and 8B). 

The ER+, ER/AGE low, LN~~ group yielded a classifier rich in both genes for 
glycolytic enzymes (12 of 56) and genes induced by hypoxia and/or angiogenesis (14 of 56) 
15 with 5 genes falling into both categories. These genes are positively correlated with poor 
outcome, implying that energy metabolism (glycolysis), angiogenesis and adaptation to 
hypoxia are critical pathways in this subgroup of tumors. None of these genes appear in the 
classifiers for the other patient subsets, and there is a much reduced predictive value of these 
genes in the other tumors, even though coordinate and similarly variable expression is seen 
20 (see FIG. 8C and 8D). 

The implication of the above analyses is that certain well known functions (growth, 
angiogenesis, energy metabolism) are important in certain tumor types and not in others, and 
therefore therapies that target these functions will be likely be similarly effective in some 
tumor subgroups and not in others. For example therapies that target cell cycle progression, 
25 such as taxol or the vinca alkaloids, may be optimally effective in the ER+, ER/AGE high 
group, where overexpression of cell cycle genes predominates in the classifier. In contrast, 
tumor subgroups in which variation in cell cycle expression is not correlated with outcome 
may be less sensitive to taxol or the vinca alkaloids. 

The "comprehensive prognosis" approach significantly improves the prediction error 
30 rate when compared with 70 gene classifier (Table 7). To make the comparison fair, we 
listed two sets of results from the 70 gene classifier. The first results from the use of the 
same threshold applied to all the patient subsets (threshold previously optimized for false 
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negative rate); the second one results from the use of a threshold optimized for each patient 
subset (optimized for average error rate). The comprehensive approach lowered the error rate 
by at least 6%. 

Table 7. Average error rate for the patient subset approach compared with the previously- 
5 described 70 gene classifier. 



Prognosis method 


over all error rate 


5 year error rate 


70 gene, fix thresh 


30.90% 


25.70% 


70 gene, opt thresh 


28.60% 


27.60% 


Comprehensive 


21.50% 


19.30% 



Fix thresh: use of a fixed threshold in the classifier as previously determined. 
Opt threshold: use of a threshold optimized for each sub-group. For the "ER/Age low, LN+" 
subgroup, 56 samples used for developing the classifier were included here, resulted in 306 
samples in total. 



6.2. EXAMPLE 2: IDENTIFICATION OF CHEMOTHERAPY RESPONSE 
SUBPOPULATION IN BREAST CANCER PATIENTS 

Predicting responders to a treatment is extremely important in daily clinical therapies, 
15 in order to select the right medicine. It is also crucial for pharmaceutical companies running 
clinical trials on new medicines to target the right population for efficacy. This Example 
shows that, by combining clinical parameters with gene expression prediction, a 
subpopulation of patients with characteristics of "ER/age low" and a "poor signature" has an 
enhanced response to chemotherapy treatment. On the other hand, patients with a poor 
20 signature but do not respond to chemotherapy are potential target for clinical trials of new 
anti-cancer medicines. 

Introduction 

Gene expression can be used to predict breast cancer outcome. A "70 gene" classifier 
(see Table 8) that can be used to identify patients with good outcome for avoiding over- 
25 treatment of chemotherapy was demonstrated (van 't Veer LJ, Dai H, van de Vijver MJ, et al., 
Gene expression profiling predicts clinical outcome of breast cancer, Nature 2002; 415: 530- 
536). This classifier has been verified by a large cohort of samples (van de Vijver MJ, He 
YD, van't Veer LJ, et al., A gene-expression signature as a predictor of survival in breast 
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cancer, N Engl J Med 2002; 347: 1999-2009). The result also implies that the treatment 
effect will be more significant within the "predicted poor" group since patients in the 
"predicted good" group are already very good, and may not improve further by additional 
treatment. 

5 Recently, it has been demonstrated that ER+ patients can be further divided into 

subpopulations. The cell proliferation signature has a very strong prognostic power in a 
group of patients characterized by a high estrogen receptor levels relative to their ages 
(termed "ER/age high" group), but reduced predictive power in the "ER/age low" group and 
no predictive power in the ER- patients (U.S. Provisional Patent Application No. 60/550,810 

1 0 by Dai et al., filed March 5, 2004, attorney docket no. 930 1 -229-888, which is incorporated 
by reference in its entirety). 

Data set 

A cohort of 3 11 breast cancer samples were used (van 't Veer LJ, Dai H, van de 
Vijver MJ, et al., Gene expression profiling predicts clinical outcome of breast cancer, Nature 
15 2002; 415: 530-536; van de Vijver MJ, He YD, van't Veer LJ, et al., A gene-expression 
signature as a predictor of survival in breast cancer, N Engl J Med 2002; 347: 1999-2009). 
Those samples were from patients younger than 55 years of age at the time of cancer 
diagnosis. Among them, 110 patients received chemotherapy. 

Results 

20 The following section shows that by combining the microarray prediction with the 

patient stratification based on clinical parameters, a subpopulation of breast cancer patients 
with enhanced chemotherapy treatment response can be identified. 

The treatment response was measured by the improvement in the metastasis free 
probability at 10 years after the diagnosis, between the treated patients and the untreated 
25 patients. 

Chemotherapy has limited effect in breast cancer patients less than 55 years of age. 
Shown in Figure 9 is the metastasis free probability as a function of time after diagnosis 
(Kaplan-Meier plot) for the entire set of patients. Patients were divided into two groups 
according to whether or not they received chemotherapy. Within the entire set of 3 1 1 
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patients younger than 55 years, 110 received chemotherapy. The treatment effect is very 
small, with the difference in 10 year metastasis free probability being only -3%. 

The "70 gene" prognostic classifier enhances the treatment effect. The purpose of the 
"70 gene" prognostic classifier (Table 8) was to save patients from unnecessary 

5 chemotherapy if they have a "good signature". In other words, there may not be further 
benefit to patients in a good prognosis patient group if they potentially have a very good 
outcome. Hence, treatment should be given to the "predicted poor" group since they could 
potentially benefit from it. Results in Figure 10 validate the idea. Figure 10a displays the 
treatment effect of the "predicted good" group. The group was selected by high correlation to 

1 0 the "average expression of good outcome patients" as determined by 70 gene marker set 

(column CI of Table 1) with correlation > 0.5. There were 84 patients within this group, and 
44 received chemotherapy. The treatment showed no positive effect in reducing the 
metastasis rate. 

However, the patients with "poor signature" demonstrate an improved treatment 
1 5 response (Figure 1 Ob) as compared to Figure 9. There are 226 patients with "poor signature" 
(correlation < 0.5), and 81 received chemotherapy. The 10 year metastasis free probability 
for those received chemotherapy is 9% higher than those did not. 

In this example, a threshold of 0.5 in correlation was used to select patients with 
"poor signature". This threshold is higher than the threshold of 0.4 that was used previously 

20 (van 't Veer LJ, Dai H, van de Vijver MJ, et al., Gene expression profiling predicts clinical 
outcome of breast cancer, Nature 2002; 415: 530-536; van de Vijver MJ, He YD, van't Veer 
LJ, et al., A gene-expression signature as a predictor of survival in breast cancer, N Engl J 
Med 2002; 347: 1999-2009), for the purpose of keeping as many patients in the "predicted 
poor" group as possible for further stratification. The results were very similar if the original 

25 threshold of 0.4 was used. 

Limiting to ER+ patients gains extra treatment response . Breast cancer patients with 
positive estrogen receptor status (ER+) have a distinctively different gene expression pattern 
than those with negative status(ER-). The "70 gene" classifier has a good prognostic power 
in the ER+ patients, but little prognostic power in the ER- patients (almost all the ER- 
30 patients were predicted to have a poor outcome). 
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The treatment effect is also very different for ER- patients than for ER+ patients with 
"poor signature". As shown in Figure 11a, there is no difference in metastasis rate between 
the treated and untreated patients in the ER- group. 

The treatment effect for ER+ patients with a "poor signature" (Figure 1 lb) is further 
5 increased when compared to Figure 10b. There are 155 patients in this group, and 60 
received chemotherapy. The metastasis free probability at 10 years for patients with 
treatment is 14% higher than those without. 

"ER/age high" group is ideal for chemotherapy treatment. Recently, it was discovered 
that the ER+ patients can be further divided into two groups according to their ER levels 

10 relative to the age at diagnosis. As shown in Figure 12, patients with relatively high ER level 
to their age are termed "ER/AGE high" group, and patients with relatively low ER level to 
their age are termed as "ER/AGE low" group. The "ER/AGE high" group has a relatively 
poor outcome compared to the "ER/AGE low" group, but the outcome in the "ER/AGE high" 
group can be accurately predicted by the cell proliferation genes. The same set of genes has a 

1 5 reduced prognostic power in the "ER/AGE low" group. 

As shown in Figure 13, chemotherapy has little effect in the patients of "ER/AGE 
high" group with a "poor signature". Excluding this group resulted in a further improved 
effect in the "ER/AGE low" group. In the "ER/AGE low" group, there are 101 patients with 
"poor signature", and 39 received chemotherapy. The difference in metastasis free 
20 probability at 10 years between the treated and untreated group is 20%. 

In summary, by progressively excluding "non-responsive" patient groups as defined 
by prognosis signature and clinical parameters (ER and age), it is possible to improve the 
treatment effect (as reported by the increased metastasis free probability at 10 years after 
diagnosis) from 3% to 20% (Table 9). Roughly, for patients younger than 55 years, there are 
25 about 1/3 of patients that do not need a treatment ("good signature"), about 1/3 patients do 
not respond to the chemotherapy (ER- and "ER/AGE high") and about 1/3 patients are 
"responders" that should be treated with chemotherapy ("ER/AGE low"). 



Table 9: Treatment effect in various patient groups 







Patients with 


ER+ & "poor 
signature" 


ER+, ER/AGE 




All patients 


"poor 
signature" 


high & "poor 
signature" 
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Difference in 










10 year 










metastasis 










free 


3% 


9% 


14% 


20% 


probability 










with/without 
chemo 











7. REFERENCES CITED 



All references cited herein are incorporated herein by reference in their entirety and 
for all purposes to the same extent as if each individual publication or patent or patent 
5 application was specifically and individually indicated to be incorporated by reference in its 
entirety for all purposes. 

Many modifications and variations of the present invention can be made without 
departing from its spirit and scope, as will be apparent to those skilled in the art. The specific 
embodiments described herein are offered by way of example only, and the invention is to be 
1 0 limited only by the terms of the appended claims along with the full scope of equivalents to 
which such claims are entitled. 
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WHAT IS CLAIMED IS : 

1 . A method for predicting responsiveness of a breast cancer patient to chemotherapy 
based on one or more of the following (i) a cellular constituent profile comprising 
measurements of a plurality of cellular constituents in a sample derived from said patient, (ii) 
5 an ER level, said ER level being a level of an estrogen receptor transcript or estrogen receptor 
protein in a sample derived from said patient, wherein ER + designates a high ER level and 
ER" designates a low ER level, and (iii) an ER/AGE, said ER/AGE being a metric of said ER 
level relative to the age of said patient, said method comprising predicting said patient to 
exhibit 

0 (a) enhanced response to chemotherapy as compared to patients in the general 

population of breast cancer patients if said cellular constituent profile is a poor prognosis 
profile, and said ER level is ER + , and said ER/AGE is low; or 



population of breast cancer patients if (i) said ER level is ER", or (ii) said cellular constituent 
1 5 profile is a poor prognosis profile, and said ER level is ER + , and said ER/AGE is high; 

wherein said cellular constituent profile is classified as a poor prognosis profile if said 
cellular constituent profile has a low similarity to a good prognosis template or has a high 
similarity to a poor prognosis template, said good prognosis template comprising 
measurements of said plurality of cellular constituents representative of levels of said cellular 

20 constituents in a plurality of good outcome patients and said poor prognosis template 

comprising measurements of said plurality of cellular constituents representative of levels of 
said cellular constituents in a plurality of poor outcome patients, wherein a good outcome 
patient is a breast cancer patient who has non-occurrence of metastases within a first period 
of time after initial diagnosis and a poor outcome patient is a patient who has occurrence of 

25 metastases within a second period of time after initial diagnosis. 

2. The method of claim 1, said method comprising predicting said patient to exhibit 
enhanced response to chemotherapy as compared to patients in the general population of 
breast cancer patients if said cellular constituent profile is a poor prognosis profile, and said 
ER level is ER + , and said ER/AGE is low. 



(b) reduced response to chemotherapy as compared to patients in the general 
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3. The method of claim 1, said method comprising predicting said patient to exhibit 
reduced response to chemotherapy as compared to patients in the general population of breast 
cancer patients if said ER level is ER". 

4. The method of claim 1, said method comprising predicting said patient to exhibit 
reduced response to chemotherapy as compared to patients in the general population of breast 
cancer patients if said cellular constituent profile is a poor prognosis profile, and said ER 
level is ER + , and said ER/AGE is high. 

5. The method of claim 1, further comprising determining said cellular constituent 
profile, said ER level, and/or, said ER/AGE. 

6. The method of claim 5, wherein said cellular constituent profile comprises 
measurements of a plurality of transcripts in a sample derived from said patient, wherein said 
good prognosis template comprises measurements of said plurality of transcripts 
representative of expression levels of said transcripts in said plurality of good outcome 
patients, and wherein said poor prognosis template comprises measurements of said plurality 
of transcripts representative of expression levels of said transcripts in said plurality of poor 
outcome patients. 

7. The method of claim 6, wherein said cellular constituent profile is a differential 
expression profile comprising differential measurements of said plurality of transcripts in said 
sample derived from said patient versus measurements of said plurality of transcripts in a 
control sample. 

8. The method of claim 1, wherein said cellular constituent profile comprises 
measurements of a plurality of protein species in a sample derived from said patient, wherein 
said good prognosis template comprises measurements of said plurality of protein species 
representative of levels of said protein species in said plurality of good outcome patients, and 
wherein said poor prognosis template comprises measurements of said plurality of protein 
species representative of levels of said protein species in said plurality of poor outcome 



9. The method of claim 7, wherein measurement of each said transcript in said good 
prognosis template is an average of expression levels of said transcript in said plurality of 
30 good outcome patients. 
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10. The method of claim 9, wherein similarity of said cellular constituent profile to 
said good prognosis template is represented by a correlation coefficient between said cellular 
constituent profile and said good prognosis template, wherein said correlation coefficient 
greater than a correlation threshold indicates a high similarity and said correlation coefficient 

5 equal to or less than said correlation threshold indicates a low similarity. 

1 1 . The method of claim 9, wherein similarity of said cellular constituent profile to 
said good prognosis template is represented by a distance between said cellular constituent 
profile and said good prognosis template, wherein said distance less than a given value 
indicates a high similarity and said distance equal to or greater than said given value indicates 

10 a low similarity. 

12. The method of claim 10, wherein said correlation threshold is 0.5. 

13. The method of claim 12, wherein said ER level is determined by measuring an 
expression level of a gene encoding said estrogen receptor in said patient relative to 
expression level of said gene in said control sample, and wherein said ER level is classified 

15 as ER + if logl0(ratio) of said expression level is greater than -0.65, and wherein said ER level 
is classified as ER' if logl0(ratio) of said expression level is equal to or less than -0.65. 

14. The method of claim 13, wherein said gene encoding said estrogen receptor is the 
estrogen receptor a gene. 

15. The method of claim 14, wherein said ER/AGE is classified as high if said ER 
20 level as measured by loglO(ratio) is greater than c (AGE - d), and wherein said ER/AGE is 

classified as low if said ER level is equal to or less than c ( AGE - d), wherein c is a 
coefficient, AGE is the age of said patient, and d is an age threshold. 

16. The method of claim 15, wherein said estrogen receptor level is measured by an 
oligonucleotide probe that detects a transcript corresponding to the gene having accession 

25 number NM_000125, wherein said control sample is a pool of breast cancer cells of different 
patients, and wherein c = 0. 1 and d = 42.5. 

17. The method of claim 16, wherein said control sample is generated by pooling 
together cDNAs of said plurality of transcripts from a plurality of breast cancer patients. 
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18. The method of claim 16, whereulWd control sample is generated by pooling 
together synthesized cDNAs of said plurality of transcripts and said transcript of said gene 
encoding said estrogen receptor. 

19. The method of claim 17, wherein said plurality of transcripts consists of 
transcripts corresponding to at least a portion of the set of genes listed in Table 8. 

20. The method of claim 19, wherein said plurality of transcripts consists of all 
transcripts corresponding to genes listed in Table 8. 

21. The method of claim 20, wherein said differential measurements is selected from 
the group consisting of xdev, log(ratio), error- weighted log(ratio), and mean subtracted 
log(intensity). 

22. The method of claim 21, wherein said first period is 10 years and said second 
period is 10 years. 

23. The method of claim 22, wherein said patient is under the age of 55, and wherein 
each said good outcome patient and each said poor outcome patient are under the age of 55 at 
time of diagnosis of breast cancer. 

24. A method for selecting a patient for enrollment in a clinical trial of a drug for 
treating breast cancer based on one or more of the following (i) a cellular constituent profile 
comprising measurements of a plurality of cellular constituents in a sample derived from said 
patient, (ii) an ER level, said ER level being a level of an estrogen receptor transcript or 
estrogen receptor protein in a sample derived from said patient, wherein ER + designates a 
high ER level and ER" designates a low ER level, and (iii) an ER/AGE, said ER/AGR being a 
metric of said ER level relative to the age of said patient, said method comprising selecting a 
patient for inclusion in said clinical trial if (a) said ER level is ER"; or (b) said cellular 
constituent profile is a poor prognosis profile, said ER level is ER + , and said ER/AGE is high; 

wherein said cellular constituent profile is classified as a poor prognosis profile if said 
cellular constituent profile has a low similarity to a good prognosis template or has a high 
similarity to a poor prognosis template, said good prognosis template comprising 
measurements of said plurality of cellular constituents representative of levels of said cellular 
constituents in a plurality of good outcome patients and said poor prognosis template 
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comprising measurements of said plurality of cellular constituents representative of levels of 
said cellular constituents in a plurality of poor outcome patients, wherein a good outcome 
patient is a breast cancer patient who has non-occurrence of metastases within a first period 
of time after initial diagnosis and a poor outcome patient is a breast cancer patient who has 
5 occurrence of metastases within a second period of time after initial diagnosis. 

25. The method of claim 24, said method comprising selecting a patient for inclusion 
in said clinical trial if said ER level is ER". 

26. The method of claim 24, said method comprising selecting a patient for inclusion 
in said clinical trial if said cellular constituent profile is a poor prognosis profile, said ER 

1 0 level is ER + , and said ER/AGE is high. 

27. The method of claim 24, further comprising determining said cellular constituent 
profile, said ER level, and/or, said ER/AGE. 

28. The method of claim 27, wherein said cellular constituent profile comprises 
measurements of a plurality of transcripts in a sample derived from said patient, wherein said 

1 5 good prognosis template comprises measurements of said plurality of transcripts 

representative of expression levels of said transcripts in said plurality of good outcome 
patients, and wherein said poor prognosis template comprises measurements of said plurality 
of transcripts representative of expression levels of said transcripts in said plurality of poor 
outcome patients. 

20 29. The method of claim 28, wherein said cellular constituent profile is a differential 

expression profile comprising differential measurements of said plurality of transcripts in said 
sample derived from said patient versus measurements of said plurality of transcripts in a 
control sample. 

30. The method of claim 24, wherein said cellular constituent profile comprises 
25 measurements of a plurality of protein species in a sample derived from said patient, wherein 
said good prognosis template comprises measurements of said plurality of protein species 
representative of levels of said protein species in said plurality of good outcome patients, and 
wherein said poor prognosis template comprises measurements of said plurality of protein 
species representative of levels of said protein species in said plurality of poor outcome 
30 patients. 



-129- 



WO 2006/084272 



PCT/US2006/00428U 



31. The method of claim 29, wherein measurement of each said transcript in said good 
prognosis template is an average of expression levels of said transcript in said plurality of 
good outcome patients. 

32. The method of claim 31, wherein similarity of said cellular constituent profile to 
5 said good prognosis template is represented by a correlation coefficient between said cellular 

constituent profile and said good prognosis template, wherein said correlation coefficient 
greater than a correlation threshold indicates a high similarity and said correlation coefficient 
equal to or less than said correlation threshold indicates a low similarity. 

33. The method of claim 31, wherein similarity of said cellular constituent profile to 
1 0 said good prognosis template is represented by a distance between said cellular constituent 

profile and said good prognosis template, wherein said distance less than a given value 
indicates a high similarity and said distance greater than said given value indicates a low 
similarity. 

34. The method of claim 32, wherein said correlation threshold is 0.5. 

15 35. The method of claim 34, wherein said ER level is determined by measuring an 

expression level of a gene encoding said estrogen receptor in said patient relative to 
expression level of said gene in said control sample, and wherein said ER level is classified 
as ER + if logl0(ratio) of said expression level is greater than -0.65, and wherein said ER level 
is classified as ER" if logl0(ratio) of said expression level is equal to or less than -0.65. 

20 36. The method of claim 35, wherein said gene encoding said estrogen receptor is the 

estrogen receptor a gene. 

37. The method of claim 36, wherein said ER/AGE is classified as high if said ER 
level is greater than c-(AGE - d), and wherein said ER/AGE is classified as low if said ER 
level is equal to or less than c ( AGE - d), wherein e is a coefficient, AGE is the age of said 

25 patient, and d is an age threshold. 

38. The method of claim 37, wherein said estrogen receptor level is measured by an 
oligonucleotide probe that detects a transcript corresponding to the gene having accession 
number NM_000125, wherein said control sample is a pool of breast cancer cells, and 
wherein c = 0.1 and d = 42.5. 



-130- 



WO 2006/084272 



PCT/US2006/004280 



39. The method of claim 38, wherein said control sample is generated by pooling 
together cDNAs of said plurality of transcripts from a plurality of breast cancer patients. 

40. The method of claim 38, wherein said control sample is generated by pooling 
together synthesized cDNAs of said plurality of transcripts and said transcript corresponding 

5 to said gene encoding said estrogen receptor. 

41. The method of claim 39, wherein said plurality of transcripts consists of 
, transcripts corresponding to at least a portion of the set of genes listed in Table 8. 

42. The method of claim 41, wherein said plurality of genes consists of all transcripts 
corresponding to genes listed in Table 8. 

10 43 . The method of claim 42, wherein said differential measurements is selected from 

the group consisting of xdev, log(ratio), error- weighted log(ratio), and mean subtracted 
log(intensity). 

44. The method of claim 43, wherein said first period is 10 years and said second 
period is 10 years. 

15 45 . The method of claim 44, wherein said patient is under the age of 55, and wherein 

each said good outcome patient and each poor outcome patient are under the age of 55 at time 
of diagnosis of breast cancer. 

46. A method for identifying a breast cancer patient as a good or poor candidate for 
chemotherapy based on one or more of the following (i) a cellular constituent profile 
20 comprising measurements of a plurality of cellular constituents in a sample derived from said 
patient, (ii) an ER level, said ER level being a level of an estrogen receptor transcript or 
estrogen receptor protein in a sample derived from said patient, wherein ER + designates a 
high ER level and ER" designates a low ER level, and (iii) an ER/AGE, said ER/AGE being a 
metric of said ER level relative to the age of said patient, said method comprising 

25 (a) determining whether said cellular constituent profile is a poor prognosis profile, 

said ER level is ER + , and said ER/AGE is low, wherein if said cellular constituent profile is a 
poor prognosis profile, said ER level is ER + , and said ER/AGE is low, said breast cancer 
patient is identified as a good candidate for chemotherapy; or 
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(b) deteraiining whether said ER level is ER\ wherein if said ER level is ER', said the 
breast cancer patient is identified as a poor candidate for chemotherapy; or 

(c) determining whether said cellular constituent profile is a poor prognosis profile, 
said ER level is ER + , and said ER/AGE is high, wherein if said cellular constituent profile is a 
poor prognosis profile, said ER level is ER + , and said ER/AGE is high, said breast cancer 
patient is identified as a poor candidate for chemotherapy. 

47. The method of claim 46, said method comprising determining whether said 
cellular constituent profile is a poor prognosis profile, said ER level is ER + , and said ER/AGE 
is low, wherein if said cellular constituent profile is a poor prognosis profile, said ER level is 
ER + , and said ER/AGE is low, said breast cancer patient is identified as a good candidate for 
chemotherapy. 

48. The method of claim 46, said method comprising determining whether said ER 
level is ER", wherein if said ER level is ER, said the breast cancer patient is identified as a 
poor candidate for chemotherapy. 

49. The method of claims 46, said method comprising deteraiining whether said 
cellular constituent profile is a poor prognosis profile, said ER level is ER + , and said ER/AGE 
is high, wherein if said cellular constituent profile is a poor prognosis profile, said ER level is 
ER + , and said ER/AGE is high, said breast cancer patient is identified as a poor candidate for 
chemotherapy. 

50. The method of claim 46, further comprising determining said cellular constituent 
profile, said ER level, and/or, said ER/AGE. 

51. The method of claim 50, wherein said cellular constituent profile comprises 
measurements of a plurality of transcripts in a sample derived from said patient, wherein said 
good prognosis template comprises measurements of said plurality of transcripts 
representative of expression levels of said transcripts in said plurality of good outcome 
patients, and wherein said poor prognosis template comprises measurements of said plurality 
of transcripts representative of expression levels of said transcripts in said plurality of poor 
outcome patients. 
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52. The method of claim 51, wherein said cellular constituent profile is a differential 
expression profile comprising differential measurements of said plurality of transcripts in said 
sample derived from said patient versus measurements of said plurality of transcripts in a 
control sample. 

5 53. The method of claim 50, wherein said cellular constituent profile comprises 

measurements of a plurality of protein species in a sample derived from said patient, wherein 
said good prognosis template comprises measurements of said plurality of protein species 
representative of levels of said protein species in said plurality of good outcome patients, and 
wherein said poor prognosis template comprises measurements of said plurality of protein 
10 species representative of levels of said protein species in said plurality of poor outcome 
patients. 

54. The method of claim 52, wherein measurement of each said transcript in said good 
prognosis template is an average of expression levels of said transcript in said plurality of 
good outcome patients. 

15 55. The method of claim 54, wherein similarity of said cellular constituent profile to 

said good prognosis template is represented by a correlation coefficient between said cellular 
constituent profile and said good prognosis template, wherein said correlation coefficient 
greater than a correlation threshold indicates a high similarity and said correlation coefficient 
equal to or less than said correlation threshold indicates a low similarity. 

20 56. The method of claim 55, wherein similarity of said cellular constituent profile to 

said good prognosis template is represented by a distance between said cellular constituent 
profile and said good prognosis template, wherein said distance less than a given value 
indicates a high similarity and said distance greater than said given value indicates a low 
similarity. 

25 57. The method of claim 56, wherein said correlation threshold is 0.5. 

58. The method of claim 57, wherein said ER level is determined by measuring an 
expression level of a gene encoding said estrogen receptor in said patient relative to 
expression level of said gene in said control sample, and wherein said ER level is classified 
as ER + if logl0(ratio) of said expression level is greater than -0.65, and wherein said ER level 
30 is classified as ER" if logl0(ratio) of said expression level is equal to or less than -0.65. 
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59. The method of claim 58, wherein said gene encoding said estrogen receptor is the 
estrogen receptor a gene. 

60. The method of claim 59, wherein said ER/AGE is classified as high if said ER 
level as measured by logl0(ratio) is greater than c(AGE - d), and wherein said ER/AGE is 

5 classified as low if said ER level is equal to or less than c-( AGE - d), wherein c is a 
coefficient, AGE is the age of said patient, and d is an age threshold. 

61. The method of claim 60, wherein said estrogen receptor level is measured by an 
oligonucleotide probe that detects a transcript corresponding to the gene having accession 
number NM_000125, wherein said control sample is a pool of breast cancer cells, and 

10 wherein c = 0.1 and d = 42.5. 

62. The method of claim 61, wherein said control sample is generated by pooling 
together cDNAs of said plurality of transcripts from a plurality of breast cancer patients. 

63. The method of claim 61, wherein said control sample is generated by pooling 
together synthesized cDNAs of said plurality of transcripts and said transcript corresponding 

1 5 to said gene encoding said estrogen receptor. 

64. The method of claim 61, wherein said plurality of transcripts consists of 
transcripts corresponding to at least a portion of the set of genes listed in Table 8. 

65. The method of claim 64, wherein said plurality of genes consists of all transcripts 
corresponding to genes listed in Table 8. 

20 66. The method of claim 65, wherein said differential measurements is selected from 

the group consisting of xdev, log(ratio), error-weighted log(ratio), and mean subtracted 
log(intensity). 

67. The method of claim 66, wherein said first period is 10 years and said second 
period is 10 years. 

25 68. The method of claim 67, wherein said patient is under the age of 55, and wherein 

each said good outcome patient and each poor outcome patient are under the age of 55 at time 
of diagnosis of breast cancer. 
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69. The method of any one of claims 1-68, wherein said chemotherapy is carried out 
using the CMF combination comprising cyclophosphamide, methotrexate, and 5-fluorouracil. 

70. A computer system comprising 
a processor, and 

5 a memory coupled to said processor and encoding one or more programs, 

wherein said one or more programs cause the processor to carry out the method of any one of 
claims 1-68. 

71 . A computer program product for use in conjunction with a computer having a 
processor and a memory connected to the processor, said computer program product 

10 comprising a computer readable storage medium having a computer program mechanism 

encoded thereon, wherein said computer program mechanism may be loaded into the memory 
of said computer and cause said computer to carry out the method of any one of claims 1-68. 

72. A method for predicting responsiveness of a breast cancer patient to 
chemotherapy based on one or more of the following (i) a cellular constituent profile 

1 5 comprising measurements of a plurality of cellular constituents in a sample derived from said 
patient, (ii) an ER level, said ER level being a level of an estrogen receptor transcript or 
estrogen receptor protein in a sample derived from said patient, wherein ER + designates a 
high ER level and ER designates a low ER level, and (iii) an ER/AGE, said ER/AGE being a 
metric of said ER level relative to the age of said patient, said method comprising predicting 

20 said patient to exhibit 

(a) enhanced response to chemotherapy as compared to patients in the general 
population of breast cancer patients if said cellular constituent profile is a poor prognosis 
profile, and said ER level is ER + , and said ER/AGE is low; or 

(b) reduced response to chemotherapy as compared to patients in the general 

25 population of breast cancer patients if (i) said ER level is ER", or (ii) said cellular constituent 
profile is a poor prognosis profile, and said ER level is ER + , and said ER/AGE is high, 
wherein said cellular constituent profile is classified as a good prognosis profile if said 
cellular constituent profile predicts non-occurrence of metastases in said breast cancer patient 
within a predetermined period of time after initial diagnosis, and wherein said cellular 



-135- 



WO 2006/084272 



PCT/US2006/004280 



constituent profile is classified as a poor prognosis profile if said cellular constituent profile 
predicts occurrence of metastasis within said predetermined period of time. 



-136- 



WO 2006/084272 



PCT/US2006/004280 



1/15 



0 Sample 

1 | Process 
22 Performance 



j; Breast tumor sample^ 



ER level 



^ER+ sample^P 



ER vs. Age | 
|/ER/age high j ^ER/oge low^ 



Prognosis 1 



LN status 



Good signature! [Poor signaturel 

/, T " ' ~f~ 

•^irr-arrT} |r// > v v v 



;5/38 (3/36) J 



^metastases,/ ^metastases' 



31/45 (24/35) 



,zrz 

Prognosis 2 Prognosis 3 



Good signature! [Poor signature | iGood signature! [Poor signature 



'6/61 (5/59)' 14/19 (11/14) '8/45 (3/38)/ /9/30 (7/27)/ 
^metastases' ^meta/ases^ ^metastases' ^metastases' 

BER- sample ^ 

| BRCA1? 



^Sporatic" jike^ 
| Prognosis 4~[ | Prognosis 5~| 
[Good signature] [ Poor signature] |Good signature] | Poor signoture] 



1/4/25 (3/23)2 Zl/z/ ^la/WM ry-rnetostasesyj pmeVsta'sesfl 
^metastases^j [^metastases' V////XA V////X/ 

FIG.l 



S J SS 77 

21/27 (19/23) 



^met 



SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 



PCT7US2006/004280 



2/15 




SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 




30 40 50 60 
Age 

FIG.2B 



30 



40 50 60 



age-10*log(ER) 

FIG.2C 



Metastases 
samples 




30 40 50 60 



FIG. 2D 

SUBSTITUTE SHEET (RULE 26) 



40 50 60 
age-10*log(ER) 

FIG.2E 



WO 2006/084272 



PCT/US2006/004280 




4/15 



n r 

20 40 60 80 
Number of Reporters 

FIG.3A 





o Group 1 
■ Group 2 




- / " 




-r- r i i 

-1 -0.5 0 0.5 




Correlation with Group 1 


FIG.3B 



m 

_ * 

— ° 

o 

• • 


° Err rate of good 
* Err rate of poor 
- Avg err rate 




- <m 9 m 

— _«■» ■" 

1 ° 1 1 r^H 1 - 



-2 - 



1.5 -1 -0.5 0 0.5 1 1.5 2 
Threshold 



SUBSTT 



ET (RULE 26) 



WO 2006/084272 



PCT/LS2006/004280 




5/15 



40 40 60 80 100 
Number of Reporters 

FIG.4A 




-0.5 0 0.5 
Correlation with Group 1 

FIG.4B 



1 

0.9 
0.8 



0.5 
0.4 
0.3 
0.2 
0.1 
0 







0 Err rate of good 
* Err rate of poor 
- Avg err rate 














I I I I 1 1 1 ~T~ tmm 





-2 



-1.5 -1 -0.5 0 0.5 1 1.5 2 
Threshold 

FIG.4C 

SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 



PCT/LS2006/00428U 




6/15 



1 

0.9 
0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
0 



n 1 r 

20 40 60 80 
Number of Reporters 

FIG.5A 



100 




t 

-0.5 0 0.5 
Correlation with Group 1 

FIG.5B 



1 \ J 

: '■- / 


0 Err rate of good 
* Err rate of poor 
— Avg err rate 




1 1 1 ^ i — r^n 



-2 -1.5 -1 -0.5 0 0.5 1 1.5 2 
Threshold 

FIG.5C 

SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 



PCT/LS2006/004280 




7/15 



20 



n 1 r 

40 60 80 
Number of Reporters 



1 

0.5 
0 

-0.5 



FIG.6A 




o Group 1 
■ Group 2 


- 

/ * oftb 

■ 8«5>op 

" X °& 

I I 1 





-0.5 0 0.5 
Correlation with Group 1 

FIG.6B 





° Err rate of good 
* Err rate of poor 
- Avg err rate 







0.6 
0.5 
0.4 



0.1 
0 



-2 -1.5 -1 -0.5 0 0.5 1 1.5 2 
Threshold 

FIG.6C 

SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 



PCT/LS2006/004280 





0.5-r 




0.4- 


-22 




O 


0.3- 


Error 


0.2- 




0.1- 




o4 



I 

1 

8 



— r 
20 



40 



— l— 
60 



I — 
80 



8/15 



100 



Number of Reporters 



FIG.7A 


1- 




o Group 1 
■ Group 2 


0.5- 


\X* ° 

° o oo 
^ o 




0- 






-0.5- 
-1- 


1 1 ' 





-0.5 



0.5 



Correlation with Group 1 

FIG.7B 




Err rate of good 
Err rate of poor 
Avq err rate __ 



-0.5 0 0.5 
Threshold 

SUBSTITUTE SH^ET (RULE 26) 



WO 2006/084272 



PCT/US2006/004280 






WO 2006/084272 



PCT/LS2006/004280 



11/15 



KM curve: 




SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 



PCT/US2006/004280 



12/15 
KM curve: Breast cancer 



1- 



0.8 

La 

1 0.6 

Q_ 

% 0.4 
I 

& 0.2 J 
0 



Chemo 
■ no Chemo 



(correlation > 0.5) 
Chi2=0.57 P=0.45(wt power=0) 



2 4 6 8 
Time (year) 

FIG. 1 0A 



10 12 



KM curve: Breast cancer 



1 

j> 0.8 

JO 

1 0.6 
a. 

J 0.4 
£ 0.2 
0 





— Chemo 

— no Chemo 


(correlation 


< 0.5) 


Chi2=1.91 P=0.167(wt power=0) 
■ i i i i 



4 6 8 10 12 
Time (year) 

FIG.10B 



SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 



PCT/US2006/004280 



13/15 
KM curve: Breast cancer 




4 6 8 10 12 
Time (year) 

FIG.11A 



KM curve: Breast cancer 




4 6 
Time (year) 

FIG.11B 



8 10 12 



SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 




I I I I 1 1 

25 30 35 40 45 50 55 
age 

FIG. 12 



SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 



PCT7US2006/004280 



0.2 

0 



15/15 
KM curve: Breast cancer 





— Chemo 

— no Chemo 






ER/age high 




Chi2=0.65 P=0.419(wt power=0) 
1 i i i i 



4 6 8 
Time (year) 

FIG.13A 



10 12 



KM curve: Breast cancer 



1 

£ 0.8 

'Jo 

1 0.6 





U 0 ODD® 










— Chemo 

— no Chemo 


ER/age low 

Chi2=2.99 P=0.0839(wt power=0) 



2 4 6 8 10 
Time (year) 

FIG.13B 



12 



SUBSTITUTE SHEET (RULE 26) 



WO 2006/084272 



PCT/US2006/004280 



SEQUENCE LISTING 

<110> Rosetta Inpharmatics, LLC 

<120> Methods of Predicting Chemotherapy Responsiveness In 
Breast Cancer Patients 

<130> 9301-249-228 

<140> 
<141> 

<150> 60/650,365 
<151> 2005-02-04 



<160> 425 

<210> 1 
<211> 4946 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> AB032969 
<400> 1 

cagcctcagc ccccagatga agatggggat cacagtgaca aagaagatga acagcctcaa 60 

gtggtggttt taaaaaaggg agacctgtca gttgaagaag tcatgaaaat taaagcagaa 120 

ataaaggctg ccaaagcaga tgaagaacca actccagccg atggaagaat catatatcga 180 

aaaccagtca agcatccctc agatgaaaaa tattcaggtt taacagcaag ctcaaaaaag 240 

aagaagccaa atgaagatga agtaaatcag gactcggtca aaaagaactc acaaaaacaa 300 

attaaaaata gtagcctcct ttcttttgac aacgaagatg aaaatgagta agtgtaaata 360 

ttttgaattt agtctacttt gaaagtatat ggagtgttca ttaaaatcac attttttcct 420 

attataaaga tactacaagt tctttataga aagtttagga aatagagaaa aaaatttaat 480 

aaactacatc tattcatcaa tacccctctg acttaaaatg ccaactctat agaaattagc 540 

tagtattaac attttgttat ttcccttgtg tggttgtata tatatgtaaa ttatattttt 600 

aagcaaaata cattttttgt gtgtaaacaa aattttataa atacaactgt attgcaaatg 660 

ttctttgtcc tgcttctcac ttgacattgc attatgagta ttcttccagg tcagtaaatt 720 

tcaaaaacct gacattaata gctacagata atttcataaa catctcattg tatctttttc 780 

attagcaata gctccacttt gggtggggga gatgataatg tgccttgtta aaaatacctc 840 

cccaactcct gctaagggtg gccatgagac tcagctctgg caagttaaga aatacaggtg 900 

gaattctgct tgataaagct gctgggtttt ttgttacaaa aggacagact tggcaaacat 960 

gagcctttgc tcttatcttt tcatcctact tggagtgcag agataaaacc tgagtaccag 1020 

agccactttt aggcataagg aaggcagcca tgtgctttgg gtcatgttag taaaaagact 1080 

cagagcttgg ctccttgctg acatgcctgg aggagctgct acaccagctt ggattgctga 1140 

cctctgactt cttggtagtg agaagaataa acactgtgct taattaggcc ttggtcaggt 1200 

ttcttttata tgcagccaaa tgcagtccta agtaatacaa taaataactg gtcaaactgt 1260 

tactggtgga gggtgtccag gttcttggca ttttggacaa ataattgaac aaaacgcaca 1320 

aagcaatgaa tatcctctag aggtttgcca ttggttactt ggcgtacacc ctgtgtaaat 1380 

gaagtagtgg cccgtgacct gtctgattgg tgcagaaagt gaccaatcag aggctgaagt 1440 

gaagttacaa agttatactc ctgtgtaaat gaggacttgg cctatgacca gtctgattgg 1500 

ttgcaggagg ggaccaatca gaggcacttt catttttcat ctgcaatgca gaaaaggcaa 1560 

ggggattgca aagggagtag cctctgatcc ttttgttact taggtatgga gaggtggggt 1620 

tttccttttg attcagttct aggaagtcaa tgtgaatcag ccttaggttc cctgtctcca 1680 

gaccctattc tcctgcctca ttttccccct gagagacgtg atcctcgtaa atctttatgg 1740 

gaggctgaga gactgagggt ctttcttctg taactgcttc atgctaactt gggacacagt 1800 

ccctacctat tggagatcac gtaactctca ccctgctttg tctaggggag acagggtagc 18 60 

ttcttgatgg ccggtggtgt cttctcctga aactggctag aaatcttgtc acatgatcat 1920 

ctaacttggt ggtctctagg caaaaggaaa tggatttggt taaaagattt aacagatatg 1980 
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gtccaaaaac caaggcaaat ataatcatta ataatgggct ggccaaggga gggagccatg 2040 

aaacccaact tagtgccctt taggtgcccc agctgttgtc atattttaga ggcccagtca 2100 

gctagttttc. aggtggtgtc ccttactaat cctgattggt tgacatcaaa acagcattct 2160 

tcttctagga aaatacataa gccacctgtt tcagcagtta ggagatctag tccccttcga 2220 

ttttgcaaag cgaccactgc caaggagcct atccgaattt gtaaggtgac aatactttga 2280 

gcaatgttat ccaggctttc cataaaatcc ttggacaagc gttggtaata ggatagggaa 2340 

gttgcaatcc cgctaactcc cattcctacc tctgctgtta ttcctagccg ttgtgtctgg 2400 

tggttgcagt taaaggtata atgagggatt ggttgttggg agctatatta atttagggac 2460 

atacaatatt tctgtctcca gtctaccact tccaccaaag acaaatcaca gcagaaccga 2520 

cctaacttca aaataaactg cagtcccata tactgggcct gattacccac acaaagtgca 2580 

acaagaatca ttgtccatat agactctcct agattggctt tgctagaaca tttcacaagg 2640 

ccatttcagt caaagtcctg agaaagtaac cggtttcaat tgtgccctat tacaaaagaa 2700 

aacgtggtta ttaactttat acagacaaat gccatgaatt aagaatattc ataaatagtt 27 60 

tacaaattct ggagaaatta gaatactcaa tacacttaaa gtgtatttca aggctataaa 2820 

tagctcaaaa taaaaagatt attcagactc tgaaaaaaca aaaagaagta gcaatatttc 2880 

aaacaacaaa agccatacaa attatttcag tcttccatta gttcatttca gtccatgtaa 2940 

tcaactcctg ctctacttca tattcatctt tatgaacaca tcagcctttc aattagtgcc 3000 

ttggaagttt tctgtctaat ccaatggcac actctccaaa gttaccagaa acctgcattc 3060 

aagagttctt ttcatgaact ccaaagaagt aagccttgga ctgtagctga ttataagtca 3120 

cttttttttt ttgagaagga tcaaagcaaa acatcaatta tggatgacaa aagtcttaag 3180 

acagccataa agacacagtt gacaaatgtg gctatttctg tggcttacaa caatttaaca 324 0 

taatcattac aacatatatt aagacatatc agaattttag aactctcata caatcctgga 3300 

acacatatta acaacaaatc tctatcagta taacccaaag gaagctaaac accacctcac 3360 

acttgacaat gtttcctgta taattcaaac attacaaata agcctaatat aagcctaata 3420 

tgtcactctt gaacttcagg aagcctaata tccaaaaagt tagtttaagg tcaaaagttt 3480 

ttgaattaac ttttttccat tagtatggtc atatctttct tactaatttg taagttatgt 3540 

aatttatcaa tttttttttg ttgttctgtt tcccaacctc tatgtcagat aaagaatcac 3600 

ccaggccaga cacagtggct catgcttgtg gtcccagcac tttgggaagc caaggtggga 3660 

gaattgcttg aagccaggaa tctgagccca gcctgggcga caaagcaata cccctatctc 3720 

tacaaaaaat aaaaaatagc caggtgtggc gacacacacc tgtggtccca gctgctcggg 3780 

aggctgagcg ggaggatggc ttgggcccag gggttcaacg ctgcagtgag ctgtgattgc 384 0 

gccactgcac tccagcctgg gcaacagagt aagaactgtc tcaaaaaaaa taaaaaatag 3900 

aaataaattt taaaaaaaga attacccata ttctctttgt ttttgtttat tcacattaac 3960 

ctttattcta tctggaattt atttgagtat acttttttct caaataatca attgtcctag 4020 

aaccatgtgt ttctcattta tttgaaaggc catctagtga gagatttctc caaatgttgg 4080 

ggtagggaag ggaggggaag cactttaaag tctgagcctt tagaggtgat tcctcaagac 4140 

cctgcttaat cctaacaatt ttcctcatta gtaaaagtca gcccaaactg ggggcttgtt 4200 

aagatcctta ccagccacat ccatctgaaa ttatgaattt caaagtatct tacaaatttg 4260 

gtgccacatt atctttttta agtttgtttt gttttgtttt tttgagacag agtctcgctc 4320 

tgtcacccgg gctggagtgc agtggcgcga tctcagctca ctgcaagctc cgcctcctgg 4380 

gttcacacca ttctcttgcc tcggcctccc aagtatctgg gactgcagtc gcccgccacc 4440 

acgcccggct aatttttttg tatttttaat agagacgggg tttcactttg ttagccagga 4500 

tggtctcaat ctcctgacct catgatccac ctgcctcggc ctcccaaagt gctgggatta 4560 

caggcaggag ccaccgcgcc tgggcctttt tttaagtttt aagtacctat aaagaacact 4 620 

gaaaggtgat gtgtgtggat gagctaggaa gacctgaaat aggctctctc taaattaatc 4 680 

aaattaatcc tgaagccatt ctgcaatact gtctttaatg tatactcact tgttatagaa 4740 

gccagggttt tttcccctaa tttgtatcat tgctatatgt gttattgtac caaactacac 4800 

tgttttaatt gctgtaaatt ttaatatgtc ttagtatctg ggtgtgggaa tcttgaaagc 4860 

atggagtttg, tgttattcac cactgtattc tcaaatatca gaagagtatc tggcctacta 4 920 
agtgcacaat aaacatagtt aaaatg 4946 

<210> 2 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> AB032969 



<400> 2 

taatcctgaa gccattctgc aatactgtct ttaatgtata ctcacttgtt atagaagcca 



60 
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<210> 3 

<211> 1007 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF005487 
<400> 3 

gaatacagaa tgtgggcaaa ctcgcttctg tgccggccgc cagaaggttt gctgagggca 60 

atcactccct ggtgccgggc tccttgaggt tatgcactgg gacatctaga gcctattgtt 120 

tgaggaatgc agtcttgcaa gcctgctctg gatcaagcca cagactgaaa cacccccgaa 180 

gagcaagcac gtttcttgga gcaggctaag tgtgagtgtc atatcttcaa tgggatgaag 240 

cgggtgcagt acctgaacag atacatccat aaacgggagg agaacctgcg cttcgacagc 300 

aacgtggagg agttccaggc agttacggaa ctggggcggc ctgtcgcaga gaactggaac 360 

agccagaagg gcatcccgga ggagaagcgg gacaagatgg acgactactg cagatacaat 420 

tacggggttt tttgagagct tcacagtgca gccgcgagtc catcctaagg tgactgtgta 480 

tcctgcaaag acccagcccc tgcatcaccg caaccccctg gtcggctctg tgagtggttt 540 

ctatccaggc agcattaaag tcaggtggtt ccagaatggt caggaagaga aggctgcggt 600 

ggtctccata ggcctgatcc agaatggaga ttggaccttc cagaccctgg tgatgctgga 660 

aacagttcct cggagtggag aggtttacac ctgccaagtg gagcatocaa gcgtgacgag 720 

ccctctcaca gtggaatgga gtacacggac tgaatctgca cagagcaaga tgctgagtgg 780 

agtcgggggc tttgtgctgg gcctgctctt ccttgggaca gggctgttca tctacttcag 840 

gaatcagaaa ggacactctg gacttcagcc aacaggactc ctgcgctgga ctcctgagct 900 

gaagtgcaca tgaccacatt caaggaagaa ccttctgcca cagctttgca ggatgaaaag 960 
ctttcccact tggctcttat tcttccacaa gagctctctc aggacca 1007 

<210> 4 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF005487 
<400> 4 

tttgcaggat gaaaagcttt cccacttggc tcttattctt ccacaagagc tctctcagga 60 

<210> 5 

<211> 3200 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF026941 



caggaagggc catgaagatt aataaagatt tggactcagg gcaaatattt acttagtagc 60 

aataactcaa agaattactg ttgaataaat aagccaatta agcagccaat cacgtactat 120 

gcggatgcac acaaatgaaa ccctcacttc aacctgaaga cattcgcaca tgagttacgt 180 

agagggacct gcaggaagcg gtagagaaaa cataaggctt atgcgtttaa tttccacacc 240 

aatttcagga tctttgtcac tgacagcagc actaagactt gttaacttta tatagttaag 300 

aagaacaagg ctgagcgcga tgactcacgc ctgtaagcct agaactttgg gaggccaaag 360 

caggcagact gcttgagccc aggagttcca gaccagcctg ggcaacatgg caacacccca 420 

tctctacaaa aaaatacaag aatcagctgg gcgtggtgat gtgttcctgt aatctcagct 4 80 

actcgggagg cagaggcagg aggattgctt gaacccggga ggcagaggtt gtagttagcc 540 

gagatctcgc cactgcactc cagtctggac gacagagtga gactcagtct caaataaata 600 

aataaataca taaatataag gaaaaaaata aagctgcttt ctcctcttcc tcctctttgg 660 

tctcatctgg ctctgctcca ggcatctgcc acaatgtggg tgcttacacc tgctgctttt 720 

gctgggaagt tcttgagtgt gttcaggcaa cctctgagct ctctgtggag gagcctggtc 780 



<400> 5 



3 




WO 2006/084272 PCT/US2006/004280 



ccgctgttct gctggctgag ggcaaccttc tggctgctag ctaccaagag gagaaagcag 840 

cagctggtcc tgagagggcc agatgagacc aaagaggagg aagaggaccc tcctctgccc 900 

accaccccaa ccagcgtcaa ctatcacttc actcgccagt gcaactacaa atgcggcttc 960 

tgtttccaca cagccaaaac atcctttgtg ctgccccttg aggaagcaaa gagaggattg 1020 

cttttgctta aggaagctgg tatggagaag atcaactttt caggtggaga gccatttctt 1080 

caagaccggg gagaatacct gggcaagttg gtgaggttct gcaaagtaga gttgcggctg 1140 

cccagcgtga gcatcgtgag caatggaagc ctgatccggg agaggtggtt ccagaattat 1200 

ggtgagtatt tggacattct cgctatctcc tgtgacagct ttgacgagga agtcaatgtc 1260 

cttattggcc gtggccaagg aaagaagaac catgtggaaa accttcaaaa gctgaggagg 1320 

tggtgtaggg attatagaat ccctttcaag ataaattctg tcattaatcg tttcaacgtg 1380 

gaagaggaca tgacggaaca gatcaaagca ctaaaccctg tccgctggaa agtgttccag 1440 

tgcctcttaa ttgaaggtga gaattgtgga gaagatgctc taagagaagc agaaagattt 1500 

gttattggtg atgaagaatt tgaaagattc ttggagcgcc acaaagaagt gtcctgcttg 1560 

gtgcctgaat ctaaccagaa gatgaaagac tcctacctta ttctggatga atatatgcgc 1620 

tttctgaact gtagaaaggg acggaaggac ccttccaagt ccatcctgga tgttggtgta 1680 

gaagaagcta taaaattcag tggatttgat gaaaagatgt ttctgaagcg aggaggaaaa 1740 

tacatatgga gtaaggctga tctgaagctg gattggtaga gcggaaagtg gaacgagact 1800 

tcaacacacc agtgggaaaa ctcctagagt aactgccatt gtctgcaata ctatcccgtt 1860 

ggtatttccc agtggctgaa aacctgattt tctgctgcac gtggcatctg attacctgtg 1920 

gtcactgaac acacgaataa cttggatagc aaatcctgag acaatggaaa accattaact 1980 

ttacttcatt ggcttataac cttgttgtta ttgaaacagc acttctgttt ttgagtttgt 2040 

tttagctaaa aagaaggaat acacacagga ataatgaccc caaaaatgct tagataaggc 2100 

ccctatacac aggacctgac atttagctca atgatgcgtt tgtaagaaat aagctctagt 2160 

gatatctgtg ggggcaatat ttaatttgga tttgattttt taaaacaatg tttactgcga 2220 

tttctatatt tccattttga aactatttct tgttccaggt ttgttcattt gacagagtca 2280 

gtattttttg ccaaatatcc agataaccag ttttcacatc tgagacatta caaagtatct 2340 

gcctcaatta tttctgctgg ttataatgct tttttttttt tttgctttta tgccattgca 2400 

gtcttgtact ttttactgtg atgtacagaa atagtcaaca gatgtttcca agaacatatg 24 60 

atatgataat cctaccaatt ttcaagaagt ctctagaaag agataacaca tggaaagacg 2520 

gcgtggtgca gcccagccca cggtgcctgt tccatgaatg ctggctacct atgtgtgtgg 2580 

tacctgttgt gtccctttct cttcaaagat ccctgagcaa aacaaagata cgctttccat 2640 

ttgatgatgg agttgacatg gaggcagtgc ttgcattgct ttgttcgcct atcatctggc 2700 

cacatgaggc tgtcaagcaa aagaatagga gtgtagttga gtagctggtt ggccctacat 2760 

ttctgagaag tgacgttaca ctgggttggc ataagatatc ctaaaatcac gctggaacot 2820 

tgggcaagga agaatgtgag caagagtaga gagagtgcct ggatttcatg tcagtgaagc 2880 

catgtcacca tatcatattt ttgaatgaac tctgagtcag ttgaaatagg gtaccatcta 2940 

ggtcagttta agaagagtca gctcagagaa agcaagcata agggaaaatg tcacgtaaac 3000 

tagatcaggg aacaaaatcc tctccttgtg gaaatatccc atgcagtttg ttgatacaac 3060 

ttagtatctt attgcctaaa aaaaaatttc tta.tcattgt ttcaaaaaag caaaatcatg 3120 

gaaaattttt gttgtccagg caaataaaag gtcattttaa tttaaaaaaa aaaaaaaaaa 3180 
aaaaaaaaaa aaaaaggcca 3200 

<210> 6 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF026941 
<400> 6 

atttttgaat gaactctgag tcagttgaaa tagggtacca tctaggtcag tttaagaaga 60 

<210> 7 

<211> 1799 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF035284 
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<400> 7 

gcttgaaccg gggaggtgga ggttgcagtg agctgagatc acgccattgt' actccagcct 60 

gggcgacaga gcaagactcc atttcaaaaa aaaaaaaaaa aaaaaaaatc cactcatata 120 

aaaggtgagc tcagctcact ggtccatttc tcagtggctt ctccatcctc atttgcaaac 180 

ctcagaggga taaggcagtt gaacctgatg agcaagaatt ataacagcaa ggaaacatta 240 

atgcttagaa ttctgagatc cagcacaact cagtctgtgg gagctcagct cgctgcccag 300 

ggataggtat gacctatgtc tgccttaggc tgctgggaga tgccattctc cagtttcaga 3 60 

agcaggcagg gcaaaggtca agactgtggt attggggtct tttggctctg aaggatcctg 4 20 

gaaccactga ttttggttta ttccctccag ggtctaaaga gaacaagagg tgctagctct 4 80 

taccaaaaca gatggtagag agagttgctg gctatttaaa aagctctttc atcttttaat 540 

tcacctcttc ttttcacctc tttaaccact cctcaggaac agaacacttc taggactggg 600 

ggtcttttag ctccataagc aagtgagcag atgggacaag ttagtctttt ctccctagaa 660 

acaaagggga tgcccagtgg tttccctttg cttcccaacc taaaatttca agtttaataa 720 

aatagcaatt agcagaagtg accaaattgg gagataatta tcagtcatga ggaaagacac 780 

agatttcggt cataaagaat gtaagggcta taagtagaaa ctttctataa cctaaatgat 840 

gttatagaat tatttttgag caggagcaga aagattaaat atgatcactt catacttcta 900 

aatcagaaat aggaagatta aaaccacaga acagtttgtg atttctattg ctggtagcta 960 

ggtatcttac tctgtccact cttgttcaag tatctaactc ttctggaaac caaataggct 1020 

ttagaagaga ttatcctata ttcctatcag tataatacta aaatgtaact ttttaatcat 1080 

ctggttttta aaagataaac agtttagccc atctctccag agagcaaaca taggaatatg 1140 

actcaggagc ctcctagggc ttatcatcag ccctcacacc cgcttccccc tccaacccac 1200 

agcctttgct tccaggtggc aggattacta ctttgcctct tcagcagcat ctactctagg 1260 

catattgatc attttagaca ctgggagaag agaacctcaa actaggagga aaagacagag 1320 

cctccactta gttttgggag gggatggcag acagtcaagg agatgagcgt cctaaggcat 1380 

gttgggatag ggtcagatgc accacccatg gagaggtttg tcaacacaaa gacatggaag 1440 

gttagaggtt tgtcaacaaa aagacatgga aggttaggtt tgtcaacaca aagacatgga 1500 

agattagagg tttgtcaaca caaagataca ggaagaatgg gctgcagaag atttagatgt 1560 

tttccatttg ggcacatttt acttagctgg agaactaggt ttaaaacagc ctgggtagga 1620 

aaattagaag caagctggat gcagtggctc atgcctgtaa tcccaacact tttgggaggt 1680 

ccaggcagga ggatcacttg ggcccaggag gtcaagcctg cagcgagctg agatcacacc 174 0 

actgcactcc agcctggggt gatagaacaa gaccctgtct caaaaaaaaa aaaaaaaaa 17 99 

<210> 8 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> AF035284 
<400> 8 

caaaaagaca tggaaggtta ggtttgtcaa cacaaagaca tggaagatta gaggtttgtc 60 

<210> 9 

<211> 1380 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF052162 
<400> 9 

gtcaaaggat atttatttat aggccttttt ttttttaata tagaatctga ggctgtttgg 60 

gctttgactt aaatttccat caggcctctc tccagcaggt aatccctctc cttccgctgg 120 

gtcccctggg gaggtgtgaa ctcaagggcc tagccccaaa acactttttc tgcttttctt 180 

aatccttttc cagtcccctc tttttttata aacgttggca gtttgatgtt tctgtttcgg 240 

cataacgtaa tccatttcac tgtagcctaa actccagtcc gaggttggat attgttcaaa 300 

tgagcagggc ccgagctgga agcgcaaggc agccgccgcc gtgccgctcc tcccttgccc 360 

tcaggccagg tccctgctgg aagcggctgc atcttcctgt cagccctggt ttccatggtg 4 20 

actggcgtca cgcagccacc cgagtatggc tgaccttcct gcagagagag gagccgcagt 4 80 

cttttgcttg tggaaggaga cgctgggctg tgcggtgcgg agggtgatga ggatgtctgg 54 0 



5 



WO 2006/084272 PCT/LS2006/004280 



tgacagccgt gcggacacca ctcctctctg cagcactgcc tcccagcgcc agggtcgcgg 600 

gcacatccca ctgagagcgg gggtcctgcc ccatcttaga gtcaaaggca gaggggcttc 660 

caggccctgg atggggtatt ttggtgtcac ctgaagtccc tctgacatca ccttgtttca 720 

tcatttttta tgacagaatt agaaacccat ccttcaagca caataatcat cacagacttg 780 

agtttgcttc ctaaagcaaa ggctccgggt ttgtttggaa aatttttttg atttctgaaa 840 

tgaattgatt tttatatttg gggcatctct atagaaagtg accaccaagg ccagtaagta 900 

cgggaaaaaa tgtttactaa cttcctcaga gattcgtgat acgcgtttct ccactgacag 960 

acatttaaaa acaaccttca gctccgtttc aatcaatcac ctcgacttgt tttttagcat 1020 

ggacactgcc agcaggacag acagggatgg agtaaaccga agtcaatttc agggctcttg 1080 

gcgtgttgga cacagaagaa atcctagtgc agcctttggt agctaacagt cactgatttt 1140 

ataattggag aatgcgtaaa gattcatttt tcaaggagaa gagcctgcaa atggccaatg 1200 

aaggaggtaa ataaactaag atattccgag ggaagggacc caggccacct cccttccgca 1260 

ggtctgcaga tgaagggttt tttgaatgaa atgccactgt gcattttcag aaaaaaaaat 1320 

ctctgataaa cagactttga atggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380 

<210> 10 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> AF052162 
<400> 10 

cagtaagtac gggaaaaaat gtttactaac ttcctcagag attcgtgata cgcgtttctc 60 

<210> 11 

<211> 1722 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF055033 
<400> 11 

ggggaaaaga gctaggaaag agctgcaaag cagtgtgggc tttttccctt tttttgctcc 60 

ttttcattac ccctcctccg ttttcaccct tctccggact tcgcgtagaa cctgcgaatt 120 

tcgaagagga ggtggcaaag tgggagaaaa gaggtgttag ggtttggggt ttttttgttt 180 

ttgtttttgt tttttaattt cttgatttca acattttctc ccaccctctc ggctgcagcc 240 

aacgcctctt acctgttctg cggcgccgcg caccgctggc agctgagggt tagaaagcgg 300 

ggtgtatttt agattttaag caaaaatttt aaagataaat ccatttttct ctcccacccc 360 

caacgccatc tccactgcat ccgatctcat tatttcggtg gttgcttggg ggtgaacaat 420 

tttgtggctt tttttcccct ataattctga cccgctcagg cttgagggtt tctccggcct 480 

ccgctcactg cgtgcacctg gcgctgccct gcttccccca acctgttgca aggctttaat 540 

tcttgcaact gggacctgct cgcaggcacc ccagccctcc acctctctct acatttttgc 600 

aagtgtctgg gggagggcac ctgctctacc tgccagaaat tttaaaacaa aaacaaaaac 660 

aaaaaaatct ccgggggccc tcttggcccc tttatccctg cactctcgct ctcctgcccc 720 

accccgaggt aaagggggcg actaagagaa gatggtgttg ctcaccgcgg tcctcctgct 780 

gctggccgcc tatgcggggc cggcccagag cctgggctcc ttcgtgcact gcgagccctg 840 

cgacgagaaa gccctctcca tgtgcccccc cagccccctg ggctgcgagc tggtcaagga 900 

gccgggctgc ggctgctgca tgacctgcgc cctggccgag gggcagtcgt gcggcgtcta 960 

caccgagcgc tgcgcccagg ggctgcgctg cctcccccgg caggacgagg agaagccgct 1020 

gcacgccctg ctgcacggcc gcggggtttg cctcaacgaa aagagctacc gcgagcaagt 1080 

caagatcgag agagactccc gtgagcacga ggagcccacc acctctgaga tggccgagga 1140 

gacctactcc cccaagatct tccggcccaa acacacccgc atctccgagc tgaaggctga 1200 

agcagtgaag aaggaccgca gaaagaagct gacccagtcc aagtttgtcg ggggagccga 1260 

gaacactgcc cacccccgga tcatotctgc acctgagatg agacaggagt ctgagcaggg 1320 

cccctgccgc agacacatgg aggcttccct gcaggagctc aaagccagcc cacgcatggt 1380 

gccccgtgct gtgtacctgc .ccaattgtga ccgcaaagga ttctacaaga gaaagcagtg 1440 

caaaccttcc cgtggccgca agcgtggcat ctgctggtgc gtggacaagt acgggatgaa 1500 
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gctgccaggc atggagtacg ttgacgggga ctttcagtgc cacaccttcg acagcagcaa 1560 

cgttgagtga tgcgtccccc cccaaccttt ccctcacccc ctcccacccc cagccccgac 1620 
tccagccagc gcctccctcc accccaggac gccactcatt tcatctcatt taagggaaaa 1680 
atatatatct atctatttga ggaaaaaaaa aaaaaaaaaa aa 1722 

<210> 12 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<400> 12 

tccaccccag gacgccactc atttcatctc atttaaggga aaaatatata tctatctatt 

<210> 13 

<211> 1411 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AK001166 

<400> 13 
aaacaaagag 
attttcccgt 
tagattggta 
gcagacctct 
agctgatatg 
atttgaatat 
cataacagat 
atcctatcct 
tcctgaaaaa 
accatttcaa 
gagaccactg 
aaagaaaagc 
aatgtttggc 
taaaatttat 
ttaagggtat 
atacatccta 
atagcgtgta 
agtagccaat 
ttaatgtctt 
gattttacta 
gagtagattt 
gaactggata 
ttaaataaaa 
aatgtaaata 



atgccacccc 
tgcatcttgt 
acgtttctga 
atagaggagc 
gatatcactt 
caaagatcat 
gccaaactct 
gaagtctatc 
cccaaaccga 
agaactagaa 
ttgttgtttt 
ataaaacaaa 
cattagtatt 
gagtcaaata 
ttttatctct 
gctactgata 
tacttgaata 
tatcatttct 
atattgtgat 
aagctagtgt 
tcatctgcct 
aattctttgg 
tgagactact 
aacttatttt 



tgtgtgatgg 
gttccaagga 
tggacaatta 
gtgtggctca 
tatctgctcc 
atggctctca 
ccaacaaaga 
aagaacgatt 
aaccacagct 
gttttcgaat 
gagtgaatgg 
gactactgaa 
tttataaaac 
tatagtgata 
gtgctttctt 
tctttattat 
agaaagacac 
cctgtataaa 
agctatgtct 
ttttataaca 
gcaatggtat 
gtacccttag 
tcctttataa 
ttttaaaaat 



ctttggtacc 
tgaagtggac 
ccaggaaatt 
tctacgaaga 
atcattttgc 
ggaacctctg 
gaaaaagaag 
tcctacacca 
gctaatgtgg 
gtaataatac 
tggttaggag 
atatagataa 
tcaatgctag 
atgttaacat 
tttcatggtg 
agccttaaga 
tgggtactgt 
ttccagtttt 
tttattgcag 
tatatattag 
aatttcagtc 
acctctgatt 
atatattttc 
g 1411 



cgaacactga 
ttggatgagt 
ctgaaagtcc 
gtccagataa 
cgtcaaatta 
gcagccttgt 
aaactgaagc 
gaaagtgcag 
gcactaaaga 
ttccacagca 
aaagactttg 
agattgcctt 
ttttaagtgt 
gtttgtaatt 
tttattaaat 
cttaatttta 
tactgtgatg 
tattgctgca 
atttattgga 
ttgatgttta 
ttagctaaaa 
ctaagtcaaa 
atccttttga 



tggttcagac 
tattagctgc 
ctttggcctt 
aatacccagg 
gtccagagga 
tggaggaagt 
agtttcagaa 
cacttctgtt 
agcctttcca 
acaggtgcta 
gtggtggaag 
agtttttaaa 
ataaattggt 
gctacagaat 
aattgtgtat 
agtcttaaaa 
ctattgactt 
cataaatttt 
tgttatgaca 
cctataagtg 
atggaaagtt 
tgcaaatggg 
aagtaagtga 



<210> 14 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AK001166 
<400> 14 

acccttagac ctctgattct aagtcaaatg caaatgggtt aaataaaatg agactacttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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<210> 15 

<211> 2352 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL049367 
<400> 15 

ggcaaacccc ttttaaaatc taatgtctgg gctttgagta ttagctcatt tagggtggac 60 

aaatgcatta ctgttttcaa actgctcaca tttattcagt atttctccaa gttgctatct 120 

actcagcctt atgaatgccc ctcgcttttc taaggccatg tgaaaatcac ggcactgccc 180 

ttagccttgt gtcatctgct ttttcgttct gcgatatgcc cagttcccaa atcaattata 240 

ggtacctgtt taggagagag gaagatttta cctctcaaag ggtgagattt gaaatttaca 300 

ctaaaaagac aactttacat ttaatgcttc acttaatgag acattctttt ttttataagt 360 

ctatttttct actcagtttc agaacactaa tctgattttc actctgattt ttaacgtttc 420 

tttaaatatt tataatgtag cttctttcaa aatattttca tgaaaaatta cttttattat 480 

accattatgt gcatgttatt ggtagcaggc atagtttatt atttagtact gaaacatgct 540 

cttttaccta acagtaaaca' agtatgtttt gatatatatc tgttaatatg cttatagtgg 600 

taagaaatgg acttgaggtc ccaggagatt tcattttatt caccctggtc agatacaata 660 

aaggctatga gtataaatac ataacttcct aaccaggtgt agggcatgtt catgaatatc 720 

aaatcttttg atgctggacc caagagagga aaagttgtag ctaaatgttg atttacttat 780 

aactagacgt ctatgtgaga aaatatatgt atacatatat atgatatgca gaagtcactt 840 

tttttatcag gctttattct ccttacaaag ccacagttta actgtctgca acagttggtt 900 

tatgttaatg atagacaaat acccagtgtt tgttactttt tccaactacc actgtaatga 960 

taatctttct cacgtatata catgcaactt cttggcttca tttccatgaa gctgtttcaa 1020 

tatattcagt atactttgtc cttaatgctg cttctgttaa cagtgatctc tttctttttt 1080 

tcattcttat atcttcatta gttcatcata aatctgtcca gttgaggcct caggaccacg 1140 

gcatgatttc atgactccga agtattttac agaaacattt tttaaataag ggaaatattt 1200 

tatataccag atggttcaca agtgatggct catagctagt tttttttttt tcttctaaaa 1260 

aatgtcaggt ttttaaaatc atttacctta ttaaaatgaa aagtgccata cttaactttt 1320 

aaaggaaaga cctgacttgc tttttctcta tttagactgt ttttgtactt tactaatctt 1380 

taaactatca ggaaaaaaac caaaacttta taccaatgat ttagtaattt tgaggcatag 1440 

ggtagcttac, gtagtggagg atgtgccaaa tattctcttc aaatgccacc ttctcaattt 1500 

ataactaaaa tagtgttatc tgactaattc ctctgaattt tgatgtaaga tctatatagg 1560 

cccccaaaat gatcgtagta catgccagtc atttctcagt gaaataaata caataccaga 1620 

gtacattatg ggttttattg ctttctttta tggtagacct gttaatgggg aaaaaataca 1680 

tcaaatcaaa tagaatctta tatctgtatg • ttaaaataga gcacttacct gaagtcagtg 1740 

gcctggatca tagccctgga tcatttccca gtctgtcctg tgctgtgtga ccttggacaa 1800 

ggcgcttcat ctctctgggc ctctatttct ccatttgtaa aacaagtggc tgcagtagat 1860 

gatggctgag agcccttcct gttcccagat gccttggtcc aaagacccca cccctctgct 1920 

ggtcctgcca acgtgttggt gctataagct gcttcagata taaaattggt ttatctataa 1980 

tgtttgttca tttaatagct tctaaaaggc ctttttgtta tacagtgctt tttttctagt 2040 

tttatggact tgattactgt aataatgtct tgtttttagc catgtaacta caaacagata 2100 

ttctcttgat gtcttagtaa atttgcattt gatatatcat tgatgagatt ttgttgttat 2160 

gtaatattct ttggctacgc atctgtccag catcttatta accataatac tgtgatcatt 2220 

atttggaaat atgtcctatg gaaagaataa aagcatgtac ttcacagcta gcatgttcac 2280 

agatttgaaa gaagtttcat taaaagcacc attgctttct gtaaaaaaaa aaaaaaaaaa 234 0 
aaaaaaaaaa aa 2352 

<210> 16 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL049367 
<400> 16 

atttggaaat atgtcctatg gaaagaataa aagcatgtac ttcacagcta gcatgttcac 60 
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<210> 17 

<211> 1130 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL080235 



<400> 17 

ggtcgccgca ccggccgcct ccggcccgcc 
ggggcgccca ccgcgctgcc agcctacccc 
cagggcgagc cgctgcattt ctgctgccta 
ccgggctggc ggctgaaccg taagcccatt 
ctggtcatcg tggtgtggag cgtggccgcc 
ttcctgccca acggcatgga acagcgccgg 
gccgcagtgc ccgcagggac caccgcagcc 
gcggccgtca cttcgggggt ggcgaccaag 
ctgtgtccgc gcgcgcgggt gcctttcccg 
tagttatcgt tagttcctct tcccgagatg 
aagcaaggtt tgtgctgcgc ttccagttcc 
gggtgggatc gcagctccga agacggagag 
catccccctg cccgactcct tccccgcacc 
ccaaatcctg tgtatttgtt ttatatattt 
aaaaaataca aaacaaaaag attaaattgc 
ctgaacatag ttgtatttga aatttgtggt 
catgggaagg atttaacacc gatatattgt 
tccaagttga tctatccagt gacgtggcct 
tttttggctt ttaatacaga cattttcctc 

<210> 18 
<211> 60 
<212> DNA 



gccgccccca gcgccgccgc cgccaccgcc 60 

gcggccgagc cgcccgggcc gctgtggctg 120 

gacttcagcc tggaggagct gcagggcgag 180 

gagtccacgc tggtggcctg cttcatgacc 240 

ctcatctggc cggtgcccat catcgccggc 300 

accaccgcca gcaccaccgc agccaccccc 360 

gccgccgccg ccgccgctgc cgccgccgcc 420 

tgacccgctc cgctcctccc tgtgtccgtc 480 

ccggggactc ggccggtgtg cttcgtgctg 540 

gggccgccga gagaccccag cgcctttgaa 600 

gaaaagcaga tgtttaagcc cttggactga 660 

gagggaaatg gggccctttc ccctctattg 720 . 

cacgtgccct agattcatgg cagaaaatga 780 

aataactgtt ttaaatgaaa gttttagtaa 840 

tattgctgta gtaagagaag ctctttgtat 900 

tttttaattt atttaaaatt ggggggaggg 960 

taccgctgaa aatgaacttt atgaaccttt 1020 

ggtgggcgtt tcttcttgta cttatgtggt 1080 
caaaaaaaaa aaaaaaaagg 1130 



<213> Homo sapiens 
<300> 

<308> AL080235 



<400> 18 

ctttgaaaag caaggtttgt gctgcgcttc cagttccgaa aagcagatgt ttaagccctt 60 

<210> 19 
<211> 2498 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AL137540 
<400> 19 

gctgaaacga cagtcttgtc cctgtcagag aaatgacctg aacgaagagc 
tacacactat gcaatctatg atttcattgt caagggcagc tgcttctgca 
tgatcaatgc atacctgttc atggcttcag acctgtcaag gccccaggaa 
ggtccatggg aagtgtatgt gtaagcacaa cacagcaggc agccactgcc 
cccgttatac aatgaccggc catgggaggc agctgatggc aaaacggggg 
gtgcagaacc tgcaagtgta atgggcatgc tgatacctgt cacttcgacg 
ggaggcatca gggaatcgta gtggtggtgt ctgtgatgac tgtcagcaca 
acagtattgc cagaggtgca agccaggctt ctatcgtgac ctgcggagac 
tccagatgct tgcaaaccgt gttcctgcca tccagtagga tcagctgtcc 
ctcagtgacc ttctgcgacc ccagcaatgg tgactgccct tgcaagcctg 



ctcaacattt 60 

atggccacgc 120 

cattccacat 180 

agcactgtgc 240 

ctcccaacga 300 

ttaatgtgtg 360 

acacagaagg 420 

ccttctcagc 480 

ttcctgccaa 540 

gggtggcagg 600 



9 



WO 2006/084272 



PCT/LS2006/004280 



gcgacgttgt gacaggtgca tggtgggata 
atgtgactgt gcggggagct gtgaccctat 
catagactgg tatcatgaag ttcctgactt 
ctgggagtgg gaggatgcgc aggggttttc 
taaggaacag acattaggaa atgccaaggc 
aaaaataaag attttatcag ctcatgataa 
taaaaaggtc ttaaaatcta ccaaactgaa 
agaatcatgg acggacagag gatgcacttg 
tgtagcagga catgaggata taagaacagg 
ccagcactgg aaaccttctc ttggaagaaa 
gtagcattaa gatggatagc acataatggc 
agcaagaaga cctcagacag gaaactggaa 
tttgaatgca tgggtcttat ctaacttatc 
atttcatgaa gagaaatgaa aacccctaca 
gaaattctta actattaaga atattttaat 
agggcagtac ctctaacaag gacgccttcc 
tacttaaagt aatgaatgac gttttaagga 
tgttaaagag ccttctcttg tgtgttacgc 
acctctccat gtgtatatag tatttccttg 
ttgtgaacgt ttggtgactg ctgttgaaag 
aagcagcagc actgccacta catgtggaca 
ttaactctga atacttggag aaactaggtg 
cgtgaagtct cagctttgag ctggaggcta 
tttaaaaaac taaaccagaa gagactttaa 
tatgcttggc taaaggggaa atggacttta 
gtatacactt caaaaattat tgtagacaca 
aaacctgaac attgaaacag ttttcctcct 
ttacctgttt tttaacacaa tgtatgtgat 
tcatcttctg tacccacgca taaccactat 
aacatgaaca cagtgccata tgaataattt 
ctgtggactt gcaaatatat atatatattg 
tgtaataaag gaatatgatc tgaaaaaaaa 

<210> 20 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL137540 



ctggggcttc ggagactatg gctgtcgacc 660 

caccggagac tgcatcagca gccacacaga 720 

ccgtcccgtg cacaataaga gcgaaccagc 780 

tgcacttcta cactcaggta aatgcgaatg 840 

attctgtgga atgaaatatt catatgtgct 900 

aggtactcat gttgaggtca atgtgaagat 960 

gattttccga ggaaagcgaa cattatatcc 1020 

tccaatcctc aatcctggtt tggaatacct 1080 

caaactaatt gtgaatatga aaagctttgt 1140 

agtcatggat attttaaaaa gagagtgcaa 1200 

acttgtctat gtacaaaaca caaactttag 1260 

ttttttaaag tgccaaaaca tatagaaatg 1320 

tcttctggac ccatgtttaa atacagtttt 1380 

ctgatatctg ttttctatgg gactgattct 1440 

agcagcatga catttagcag taatccatta 1500 

agcttcagcg atgttactta cgtttgatgc 1560 

atccctaacc ctactatcag aaaaggtgtt 1620 

atgaactttg gtctgtaggt gttaaatgga 1680 

tataaagcac tttactacct accacttgtg 1740 

aaggaaaagg gtgtgtgaga aagcctactg 1800 

aaagtgacca tataaaagaa gttgtgctat 1860 

aagatgcaac cagaaaggag aatatgtatg 1920 

gattccaaga tgacagccat gatgaaactt 1980 

aataagagaa agaaatcata aatgtagaca 2040 

aattttaaag agctcatttg caatgcactt 2100 

gaatttgtta tatttttgtg cttagtattt 2160 

tgtctttctt aacagtaata gtcattatat 2220 

agtcaaaaaa tcacagtttt tcattattat 2280 

acatagtttc ttttgtactt gaatatacaa 2340 

cacatacaga accttttttt ctctgaagtc 2400 

ctttgttaat ttgtttttat atttcatata 2460 
aaaaaaaa 24 98 



<400> 20 

tggaggctag attccaagat gacagccatg atgaaacttt ttaaaaaact aaaccagaag 60 

<210> 21 
<211> 914 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AL160131 
<400> 21 

cgcaccgcag gagcaacggt 
aagctgcccg gcctgaacac 
cagcagctgg cggactcgat 
gcaaagtccc tccctttgcc 
gtggttaatc ttcacagcaa 
gatgccagct tcttcttggg 
cactgcagca ttcaccggca 



tggtcctgcg gctgtgatgt cggtgttgag gcccctggac 60 

ggccaccatc ttgctggtgg gcacggagga tgctcttctg 120 

gctcaaagag gactgcgcct ccgagctgaa ggtccacttg 180 

ctccagtgtg aatcggcccc gaattgacct gatcgtgttt 240 

atacagtctc cagaacacag aggagtccct gcgccatgtg 300 

gaaggtgtgt ttcctcgcca caggtgctgg gcgggagagc 360 

caccgtggtg aagctggccc acacctatca aagccccctg 420 
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ctctactgtg 


acctggaggt 


ggaaggcttt 


agggccacca 


tggcgcagcg 


cctggtgcgc 


480 


gtgctgcaga 


tctgtgctgg 


ccacgtgccc 


ggtgtctcag 


ctctgaacct 


gctgtccctg 


540 


ct 9 a 9 a agct 


ctgagggccc 


ctccctggag 


gacctgtgag 


ggtggctggc 


ccctgggctg 


600 


ccccttctca 


tggcttcgtg 


ctgactccat 


aaacattctc 


tgttgaggat 


gtccagtcag 


660 




gcccaggctc 


agcccgccgt 


ggctgggaag 


gttccctgca 


gtgccagtgc 


720 


tgcagcaggg 


agagctgggc 


agaagcagcg 


agggggecca 


getggegaga 


ctgtagcccc 


780 


ctcccactcc 


cacactcact 


cttgcagagc 


ctgtgtcttt 


aagcagctgg 


cgtgttacat 


840 


ctccatttaa 


ggtttccttt 


gaacaaaagg 


tctgtggcta 


aaaaaagttt 


aaaaatcact 


900 


ggtctcattc 


acca 914 












<210> 22 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> AL160131 












<400> 22 














agctggcgtg ttacatctcc 


atttaaggtt 


tcctttgaac 


aaaaggtctg 


tggctaaaaa 


60 


<210> 23 














<211> 4753 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> D13642 












<400> 23 














cttcaatcaa 


gtagccttcc 


ac gcagea 


cacacccagg 


aaatttgtca 


tccaccctga 


60 


gagtaacaac 


cttattatca 


ttgaaacgga 


ccacaatgcc 


tacactgagg 


ccacgaaagc 


120 


tcagagaaag 


cagcagatgg 


cagaggaaat 


ggtggaagca 


gcaggggagg 


atgagcggga 


180 


gctggccgca 


gagatggcag 


cagcattcct 


caatgaaaac 


ctccctgaat 


ccatctttgg 


240 


agctcccaag 


gctggcaatg 


ggcagtgggc 


ctctgtgatc 


cgagtgatga 


atcccattca 


300 


agggaacaca 


ctggaccttg 


tccagctgga 


acagaatgag 


gcagctttta 


gtgtggctgt 


360 


gtgcaggttt 


tccaacactg 


gtgaagactg 


gtafcgtgctg 


gtgggtgtgg 


ccaaggacct 


420 


gatactaaac 


ccccgatctg 


tggcaggggg 


ettegtctat 


acttacaagc 


ttgtgaacaa 


480 


tggggaaaaa 


ctggagtttt 


tgcacaagac 


tcctgtggaa 


gaggtccctg 


ctgctattgc 


540 


cccattccag 


gggagggtgt 


tgattggtgt 


ggggaagctg 


ttgcgtgtct 


atgacctggg 


600 


aaagaagaag 


ttactccgaa 




taagcatatt 


gecaattata 


tctctgggat 


660 


ccagactatt 


ggacataggg 


aa 9 a c 


tgatgt ccaa 


gaaagtttca 


tctgggttcg 


720 


ctacaagcgt 


aatgaaaacc 


age a ca 


etttgetgat 


gatacctacc 


cccgatgggt 


780 


cactacagcc 


agcctcctgg 


ac a gacac 


tgtggctggg 


gcagacaagt 


ttggcaacat 


840 


atgtgtggtg 


aggctcccac 


ctaacaccaa 


tgatgaagta 


gatgaggatc 


ctacaggaaa 


900 


caaagccctg 


tgggaccgtg 


gettgetcaa 


tggggcctcc 


cagaaggcag 


aggtgatcat 


960 


gaactaccat 


gtcggggaga 


cggtgctgtc 


ettgeagaag 


accacgctga 


tccctggagg 


1020 


ctcagaatca 


cttgtctata 


ccacc g c 


^ggaggaa 


ggcatccttg 


tgccattcac 


1080 


gtcccatgag 


gaccatgact 




g ggaaa g 


cacctgcggt 


ctgaacatcc 


1140 


ccctctctgt 


gggcgggacc 


tt 

cc cage 




tacttccctg 


tgaagaatgt 


1200 


gattgatgga 


gacctctgtg 






cccaacaaac 


aaaagaacgt 


1260 


ctctgaagaa 


ctggaccgaa 


ccccacccga 


agtgtccaag 




atatceggae 




ccgctacgcc 


ttctgagccc 


tcctttcccg 


gtggggcttg 


ccagagactg 


tgtgttttgt 


1380 


ttcccccacc 


accatcactg 


ccacctggct 


tetgecatgt 


ggcaggaggg 


tgactggata 


1440 


attaagactg 


cattatgaaa 


gtcaacagct 


ctttcccctc 


agctcttctc 


ctggaatgac 


1500 


tggcttcccc 


tcaaattggc 


actgagattt 


gctacacttc 


tccccacctg 


gtacatgata 


1560 


catgacccca 


ggttccagtg 


tagaacctga 


gtcccccatt 


ccccaaagcc 


atccctgcat 


1620 


tgatatgtct 


tgactctcct 


gtctactttt 


gcacacaccc 


ttaattttta 


attggttttc 


1680 


ttgtaaatac 


agttttgtac 


aatgttatct- 


ctgtgggagg 


aaggaggcag 


gctgtggtgg 


1740 


gactgggtag 


ggtatagtat 


cactcctgag 


ttccactgct 


ctagaatcta 


accagaaata 


1800 


gaaacctagt 


ttttaaggtg 


actggcatcc 


atgtgtcttg 


ttctggagat 


gaggatgtag 


1860 
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gtgggaggtt 
accgacttgg 
gttttttcct 
cccacccctc 
ctgaggatgt 
cccttttact 
agtacagaag 
tgcagcaccc 
gtggaagagg 
ttgtgtggga 
tcactgttct 
ctaaattgag 
ctgaacctct 
taagtatctc 
aggaatcgaa 
gacagtagtt 
cctaaatcca 
tagatacttg 
tccgtacaga 
tgtcaccacc 
gaagggctag 
tgcaggacct 
tgtaattgtt 
agacctctct 
atctgccctt 
aggactgaaa 
agatgtttcc 
tagttggaat 
cgctggaaaa 
cgctgctgcc 
tctggctttc 
aaattttatt 
agctcactga 
agactacaga 
tccctgtgtt 
ctcccagcgc 
gcctgggcaa 
ggtgcacact 
gagtttgagg 
gaaacctgtc 
aacgcacatg 
cctgggagat 
ttccaagcca 



tgaacccaag 
acttgcggca 
ggttcttttt 
ttcccactgg 
tagtatgggg 
gatagagcac 

atctcaggtg 
cccagttcca 
agccctcgtc 
caatacttca 
caaatgatct 
cagagcccag 
ttgaatcatg 
tccctaaata 
ctgaattaga 
ttctatcaaa 
agcagctcct 
aggtaggggg 
gagcttcctc 
gacaaggatt 
acaaaagttt 
aaggggacct 
ccagctttgg 
gggtttgctt 
agtttttagc 
tttggaagat 
gttgcttttc 
ggctcgtctc 
tttcttgagc 
aaattttttt 
agagacagtc 
ggcttgaact 
tgtgtgccac 
gcccaggctg 
tttgagaggc 
catgacaaaa 
tgtagtccca 
ctgoagtgag 
tcaaaaacac 
actgagtgct 
ctgggggacc 
agaggcccca 



ttagagcagg 
ctctgtggct 
gctgtgctga 
gtattgagga 
attttagcat 
ctaaagcgat 
ctgccctgcc 
ggttccagag 
gaaatgttcc 
ttccaggctg 
caaatgtaaa 
ggtgagcact 
aacagagggt 
ac'ttggtttt 
acatgttttt 
ttttgaaaac 
ttgtgtgata 
cgcctctttt 
ttatgtaaga 
tggaagtctg 
tgggcttgaa 
aaaattagat 
gattgactcc 
caaaacccat 
ccatcctgtt 
tgcctcagct 
ctcttgccaa 
ttggttagtg 
acagggagag 
ttgttagagg 
taattcctaa 
tctctctctt 
cctgggctcg 
catactcagc 
gcctcgaact 
tgaggcagga 
ccccatctct 
gtaattaggg 
ctgtgattgc 
caaaaaacaa 
caggggttct 
tgctgtccta 
gagagggcac 



aagaactgag 
ccccaccccc 
tgaaacatga 
agggtggctg 
gaattccagc 
ctttggctcc 
cctgagaact 
ggcctttagg 
agcccacccc 
tccttgcgcc 
actttctttc 
gggttagaat 
tcctgacact 
agatcagtca 
ttctcactta 
ctaatttcag 
ctgacatgca 
ctaactcaag 
aaggaaaacc 
cccatcagct 
tatgtggaaa 
tggatgtgac 
tgtggtttga 
cagaggctgc 
ccattgctaa 
ttcccctgac 
gaatagcatt 
taaatgtatt 
tgctggtccc 
aaagccagaa 
gttctgtttt 
gcctagctgg 
agcaatccac 
tagtttttaa 
cctgacctca 
ggatcccttg 
ccaaaaatac 
ggctgagaca 
gccactacac 
aaaccggtct 
gaggcttgtc 
tatgtgatgc 
cgtggggtgt 



tagactcctt 
aggtctgtgg 
cctcaataac 
attcttcctc 
tggggagtct 
ataggaccat 
gtgtatatgt 
gtataatgag 
tgagaattcc 
ttgaacctgg 
gtctgcatgt 
caggaatggt 
gtgacactgt 
agagagaccc 
gctcatgaat 
ggctcatttt 
gtcatctgag 
tttgactaaa 
taatctatgg 
tgcttgttct 
ggaattttca 
tcaatgacaa 
ttgagcaacc 
tgcagaactc 
gcccttgtga 
cttactggca 
cctttggagg 
gctagtgaga 
cagaatgtgt 
aggcattcag 
attttttaat 
gagtgcagtg 
ctcagcctcc 
actttcgtag 
aaaaatcttc 
agcccaggag 
aaaaattggc 
ggaggatcac 
tccagcctgg 
cctggggtca 
cgctgacctg 
tttgaaagaa 
tcaggcttct 



ccttccagat 
tggtttcttt 
catgtgtata 
ctcttctact 
taacagatgc 
aggaagggtc 
ggggcctgtc 
agcctgttag 
tcctgtttag 
agaagtgagc 
gctcagccat 
ggaatacaat 
ctcctggaac 
aggttttgcc 
ttgcatagta 
ttcctgtggc 
gaactcagcg 
atacatacac 
aatcaggagt 
ctgttaagag 
tagttgttgc 
gtcccatctg 
aggtaaatag 
agacagaggg 
cttggatcct 
gaggttctgc 
aggggggttc 
cagctgccgg 
gctgttccca 
atgggatcag 
tttttaaaaa 
gagtgatcat 
agagtagggg 
agacagggtc 
ctgccttggc 
tttgagacca 
caggcatggt 
ttcagcctat 
atgacaggac 
tggtagcaca 
gggctctggc 
aggggcatca 
gtgaggcccc 



agtgagatcc tgtggctgtg cccccatcac ctccacccac tctgccctcc cactagctgc 
ccaacggatg aatcaacgcc ttggcagagt tttccagcag ggccttgcag agagtgtgtg 
tgacctgtgt ggccactgcc ttggggacgg gtgaggagtt agcctggaac attccagcgt 
gggcattatt gtcctgttgc aagttcaggg caaaaccagg aatccagttt tgtcgatcca 
attgagaaaa catttcatga acaactactt gtggcatgca ttggcactcg gaataaagcg 
cactattgtc act 4753 

<210> 24 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> D13642 
<400> 24 

aaaccaggaa tccagttttg tcgatccaat tgagaaaaca tttcatgaac aactacttgt 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 

4500 
4560 
4620 
4680 
4740 
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<210> 25 

<211> 2591 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> D25328 
<400> 25 

cccggacgtg cggctcccct cggcctcctc gccatggacg cggacgactc ccgggccccc 60 

aagggctcct tgcggaagtt cctggagcac ctctccgggg ccggcaaggc catcggcgtg 120 

ctgaccagcg gcggggatgc tcaaggtatg aacgctgccg tccgtgccgt ggtgcgcatg 180 

ggtatctacg tgggggccaa ggtgtacttc atctacgagg gctaccaggg catggtggac 24 0 

ggaggctcaa acatcgcaga ggccgactgg gagagtgtct ccagcatcct gcaagtgggc 300 

gggacgatca ttggcagtgc gcggtgccag gccttccgca cgcgggaagg ccgcctgaag 360 

gctgcttgca acctgctgca gcgcggcatc accaacctgt gtgtgatcgg cggggacggg 420 

agcctcaccg gggccaacct cttccggaag gagtggagtg ggctgctgga ggagctggcc 480 

aggaacggcc agatcgataa ggaggccgtg cagaagtacg cctacctcaa cgtggtgggc 540 

atggtgggct ccatcgacaa tgatttctgc ggcaccgaca tgaccatcgg cacggactcc 600 

gccctgcaca ggatcatcga ggtcgtcgac gccatcatga ccacggccca gagccaccag 660 

aggaccttcg ttctggaggt gatgggacga cactgtgggt acctggccct ggtgagtgcc 720 

ttggcctgcg gtgcggactg ggtgttcctt ccagaatctc caccagagga aggctgggag 780 

gagcagatgt gtgtcaaact ctcggagaac cgtgcccgga aaaaaaggct gaatattatt 840 

attgtggctg aaggagcaat tgatacccaa aataaaccca tcacctctga gaaaatcaaa 900 

gagcttgtcg tcacgcagct gggctatgac acacgtgtga ccatcctcgg gcacgtgcag 960 

agaggaggga ccccttcggc attcgacagg atcttggcca gccgcatggg agtggaggca 1020 

gtcatcgcct tgctagaggc caccccggac accccagctt gcgtcgtgtc actgaacggg 1080 

aaccacgccg tgcgcctgcc gctgatggag tgcgtgcaga tgactcagga tgtgcagaag 114 0 

gcgatggacg agaggagatt toaagatgcg gttcgactcc gagggaggag ctttgcgggc 1200 

aacctgaaca cctacaagcg acttgccatc aagctgccgg atgatcagat cccaaagacc 1260 

aattgcaacg tagctgtcat caacgtgggg gcacccgcgg ctgggatgaa cgcggccgta 1320 

cgctcagctg tgcgcgtggg cattgccgac ggccacagga tgctcgccat ctatgatggc 1380 

tttgacggct tcgccaaggg ccagatcaaa gaaatcggct ggacagatgt cgggggctgg 1440 

accggccaag gaggctccat tcttgggaca aaacgcgttc tcccggggaa gtacttggaa 1500 

gagatcgcca cacagatgcg cacgcacagc atcaacgcgc tgctgatcat cggtggattc 1560 

gaggcctacc tgggactcct ggagctgtca gccgcccggg agaagcacga ggagttctgt 1620 

gtccccatgg tcatggttcc cgctactgtg tccaacaatg tgccgggttc cgatttcagc 1680 

atcggggcag acaccgccct gaacactatc accgacacct gcgaccgcat caagcagtcc 1740 

gccagcggaa ccaagcggcg cgtgttcatc atcgagacca tgggcggcta ctgtggctac 1800 

ctggccaaca tgggggggct cgcggccgga gctgatgccg catacatttt cgaagagccc 1860 

ttcgacatca gggatctgca gtccaacgtg gagcacctga cggagaaaat gaagaccacc 1920 

atccagagag gccttgtgct cagaaatgag agctgcagtg aaaactacac caccgacttc 1980 

atttaccagc tgtattcaga agagggcaaa ggcgtgtttg actgcaggaa gaacgtgctg 204 0 

ggtcacatgc agcagggtgg ggcaccctct ccatttgata gaaactttgg aaccaaaatc 2100 

tctgccagag ctatggagtg gatcactgca aaactcaagg aggcccgggg cagaggaaaa 2160 

aaatttacca ccgatgattc catttgtgtg ctgggaataa gcaaaagaaa cgttattttt 2220 

caacctgtgg cagagctgaa gaagcaaacg gattttgagc acaggattcc caaagaacag 2280 

tggtggctca agctacggcc cctcatgaaa atcctggcca agtacaaggc cagctatgac 234 0 

gtgtcggact caggccagct ggaacatgtg cagccctgga gtgtctgacc cagtcccgcc 2400 

tgcatgtgcc tgcagccacc gtggactgtc tgtttttgta acacttaagt tattttatca 2460 

gcactttatg cacgtattat tgacattaat acctaatcgg cgagtgccca tctgccccac 2520 

cagctccagt gcgtgctgtc tgtggagtgt gtctcatgct ttcagatgtg catatgaqca 2580 
gaattaatta a 2591 

<210> 26 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 
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<308> D25328 



<400> 26 

tattttatca gcactttatg cacgtattat tgacattaat acctaatcgg cgagtgccca 60 

<210> 27 

<211> 2573 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> D50402 



<400> 27 

gaatcggccg atgtgaaccg aatgttgatg 
taacagggcg tgggctggca cacttacttg 
aggagctgcc cagagcaccg ctcacactcc 
aggtgacaag ggtccccaaa ggctaagcgg 
cagcccgacc agcccagggc cacggcaagc 
gatccccatc ccagacacaa aaccgggcac 
ggggcctggc ttcctcatga gcattgcttt 
tcaggctggc gccgtggcgg gattcaaact 
gggcttgctc tgccagcgac tggctgcacg 
cgaggtctgc catctctact accctaaggt 
gctagccatt gtgggctccg acatgcagga 
gctctcagct ggacgaatcc cactctgggg 
cttcttcctc ttcctcgata actacgggct 
tataaccatt atggccttga cctttggcta 
agcgcttctt cgg'ggcctgt tcctgccctc 
gcaggcggtg ggcattgttg gcgccatcat 
cctggtcaag tctcgagaga tagaccgggc 
gtacttcctg attgaggcca ccatcgccct 
catggctgtc tttgggcagg ccttctacca 
tgccaacagc agcctccacg actacgccaa 
cgtggacatt taccaggggg gcgtgatcct 
catctgggcc ataggtctcc tggcggctgg 
gggacagttc gtgatggagg gcttcctgag 
cctcacccgc tcctgcgcca tcctgcccac 
ggacttgtcg ggcctcaatg atctgctcaa 
cgtgctgccc atcctcacgt tcaccagcat 
cctgctgaac aaggtcgtca cctcttccat 
cttcgtggtc agctatctgc ccagcctgcc 
gctggccgca gcctacctgg gcctcagcac 
cggagccacc tttctggccc acagctccca 
ggaccagaaa ggggagacct ctggctaggc 
tatgacgtga ctggcctgct ggatgtggag 
gacagttcct gagaccagcc aacctggggg 
catgtgatta ccctctgggt ctcagtgtcc 
tgccatggag gttaagcact ttaacacagt 
aaacaaaaaa catttcaaaa ggtatttatt 
agggtgggtg gggtgagggc ttgaggactt 
gaaatagtgt ctgatgaatg ttaaattatc 
agggaaaggg tctccctctg ttgccaaggc 
gcaacctcca ccttctgggt tcaagcgatt 
ccaccacgcc cagctaattt gtgtattttc 
gctggtctcg aactgctgga ttcaagtgat 
aattacaggc gtgagccacc aaaacccggc 

<210> 28 
<211> 60 



taagaggcag ggcactcggc tgcggatggg 60 

caccagtgcc cagagagggg gtgcaggctg 120 

cagagtacct gaagtcggca tttcaatgac 180 

gtccagctat ggttccatct ccagcccgac 240 

acctcccaga gagacctacc tgagtgagaa 300 

cttcagcctg cggaagctat gggccttcac 360 

cctggaccca ggaaacatcg agtcagatct 420 

tctctgggtg ctgctctggg ccaccgtgtt 480 

tctgggcgtg gtgacaggca aggacttggg 540 

gccccgcacc gtcctctggc tgaccatcga 600 

agtcatcggc acggccattg cattcaatct 660 

tggcgtcctc atcaccatcg tggacacctt 720 

gcggaagctg gaagcttttt ttggactcct 780 

tgagtatgtg gtggcgcgtc ctgagcaggg 84 0 

gtgcccgggc tgcggccacc ccgagctgct 900 

catgccccac aacatctacc tgcactcggc 960 

ccgccgagcg gacatcagag aagccaacat 1020 

gtccgtctcc tttatcatca acctctttgt 1080 

gaaaaccaac caggctgcgt tcaacatctg 1140 

gatcttcccc atgaacaacg ccaccgtggc 1200 

gggctgcctg ttcggccccg cggccctota 1260 

gcagagctcc accatgacgg gcacctacgc 1320 

gctgcggtgg tcacgcttcg cccgtgtcct 1380 

cgtgctcgtg gctgtcttcc gggacctgag 1440 

cgtgctgcag agcctgctgc tcccgttcgc 1500 

gcccaccctc atgcaggagt ttgccaatgg 1560 

catggtgcta gtctgcgcca tcaacctcta 1620 

ccaccctgcc tacttcggcc ttgcagcctt 1680 

ctacctggtc tggacctgtt gccttgccca 1740 

ccaccacttc ctgtatgggc tccttgaaga 1800 

ccacaccagg gcctggctgg gagtggcatg 1860 

ggggcgcgtg caggcagcag gatggagtgg 1920 

ctttagggac ctgctgtttc ctagcgcagc 1980 

tcatctgtaa aatggagacg ccaccaccct 2040 

gtctggcact tgggacaaaa acaaacaaac 2100 

gagcacctgc aggcgtgacc tgacagccca 2160 

gggcgggaca caggctccaa actggagctt 2220 

tatctatcta tttatttatt tatttgagac 2280 

tggagtgcag tggcgcaatc ttaactcatt 2340 

ctctttattc agccccggga gtggcgcgcg 2400 

agcagagacg gggtttgcca tgctggccag 24 60 

ccgcccatct ccgtctccca aagtgctggg 2520 
ctgattaaag ttaaataaat acg 2573 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> D50402 
<400> 28 

tggaggttaa gcactttaac acagtgtctg gcacttggga caaaaacaaa caaacaaaca 60 

<210> 29 

<211> 3672 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> L27560 
<400> 29 

acatgtgcat atttcattcc ccaggcagac attttttaga aatcaataca tgccccaata 60 

ttggaaagac ttgttcttcc acggtgacta cagtacatgc tgaagcgtgc cgtttcagcc 120 

ctcatttaat tcaatttgta agtagcgcac gagcctctgt gggggaggat aggctgaaaa 180 

aaaaaagtgg gctcgtattt atctacagga ctccatatag tcatatatag gcatataaat 240 

ctatgctttt tctttgtttt tttctttctt cctttctttc aaaggtttgc attaactttt 300 

caaagtagtt cctatagggg cattgaggag cttcctcatt ctgggaaaac tgagaaaacc 360 

catattctcc taatacaacc cgtaatagca tttttgcctg cctcgaggca gagtttcccg 420 

tgagcaataa actcagcttt tttgtggggc acagtactgg atttgacagt gattccccac 4 80 

gtgtgttcat ctgcacccac cgagccaggc agaggccagc cctccgtggt gcacacagca 540 

cgcgcctcag tccatcccat tttagtcttt aaaccctcag gaagtcacag tctccggaca 600 

ccacaccaca ttgagcccaa caggtccacg atggatccac ctagtcccac cccagccttt 660 

ttctttcatc tgaacagaat gtgcattttt ggaagcctcc ctcactctcc atgctggcag 720 

agcaggaggg agactgaagt aagagatggc agagggagat ggtggcaaaa aggtttagat 780 

gcaggagaac agtaagatgg atggttccgg ccagagtcga tgtggggagg aacagagggc 840 

tgaagggaga gggggctgac tgttccattc tagctttggc acaaagcagc agaaaggggg 900 

aaaagccaat agaaatttcc ttagcttccc caccatatgt attttcatgg atttgagagg 960 

aaagagagga aaatggggga atgggttgca aaatagaaat gagcttaatc caggccgcag 1020 

agccagggaa ggtgagtaac cttaggaggg tgctagactt tagaagccag ataggaagaa 1080 

tcagtctaaa ctggccatgc tttggaaggg acaagactat gtgctccgct gcccaccttc 1140 

agcctgcaat gagggactga ggcccacgag tctttccagc tcttcctcca ttctggccag 1200 

tccctgcatc ctccctgggg tggaggatgg aaggaaagct gggacaagca gggaacgcat 1260 

gattcaggga tgctgtcact cggcagccag attccgaaac tcccattctc caatgacttc 1320 

ctcaaccaat gggtggcctt gtgactgttc tttaaggctg aagatatcca ggaaaggggg 1380 

cttggacact ggccaaggag accccttcgt gctgtggaca cagctctctt cactctttgc 1440 

tcatggcatg acacagcgga gaccgcctcc aacaacgaat ttggggctac gaagaggaat 1500 

agcgaaaaag caaatctgtt tcaactgatg ggaaccctat agctatagaa cttgggggct 1560 

atctcctatg cccctggaca ggacagttgg ctggggacag gagaagtgct caatcttcat 1620 

gagacaaagg ggcccgatca aggcagccac aaggccttga cctgccgagt cagcatgccc 1680 

catctctctc gacagctgtc ccctaaaccc aactcacgtt tctgtatgtc ttaggccagt 1740 

atcccaaacc tcttccacgt cactgttctt tccacccatt ctccctttgc atcttgagca 1800 

gttatccaac taggatctgc caagtggata ctggggtgcc actcccctga gaaaagactg 1860 

agccaggaac tacaagctcc ccccacattc ctcccagcct ggacctaatt cttgagaggg 1920 

gctctctctt cacggactgt gtctggactt tgagcaggct tctgcccctt gcgttggctc 1980 

tttgctgcca gccatcaggt gggggattag agcctggtgt aagtgcgcca gactcttccg 204 0 

gtttccaaag ttcgtgcctg cgaacccaaa cctgtgagtc tcttctgcat gcaggagttt 2100 

ctcctgggca gctggtcact ccccagagaa gctgggcctt catggacaca tggaactaag 2160 

cctcccaaat gggagttctg gctgagccca gggtggggag atcctgggaa gggaggcact 2220 

ggaggaagac ggcacctctt cccccatggc agggtgtgag ggaggcaggt ttggaatggt 2280 

gcgagtatgg caatctaagc aggggtctgg tctctttgac tccaggctcg ctttggccga 234 0 

ctgtctgctc acccagagac cttggactcc ggactatcca tggctccgaa tctaagtgct 2400 

gcccactccc atgctcacac ccacagaagg tcttcccatc ccctttagat tcgtgcctca 24 60 

ctccaccagt gaggaagatg cctctgtctt tcccacgact gccaggagat agggaagccc 2520 

agccaggact gaccctcctt cctccagcct gccctgaccc acctggcaaa gcagggcaca 2580 
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tggggaggaa gagactggaa cctttctttg acagccaggc ctagacagac aggcctgggg 2640 

acactggccc atgaggggag gaaggcaggc gcacgaggtc cagggaggcc cttttctgat 2700 

catgcccctt ctctcccacc ccatctcccc accaccacct ctgtggcctc catggtaccc 27 60 

ccacagggct ggcctcccct agagggtggg cctcaaccac ctcgtcccgc cacgcaccgg 2820 

ttagtgagac agggctgcca cgcaaccgcc aagcccccct caaggtggga cagtaccccg 2880 

gacccatcca ctcactcctg agaggctccg gcccagaatg ggaacctcag agaagagctc 2940 

taaggagaag aaaccccata gcgtcagaga ggatatgtct ggcttccaag agaaaggagg 3000 

ctccgfctttg caaagtggag gagggacgag ggacaggggt ttcaccagcc agcaacctgg 3060 

gccttgtact gtctgtgttt ttaaaaccac taaagtgcaa gaattacatt gcactgtttc 3120 

tccacttttt attttctctt aggcttttgt ttctatttca aacatacttt cttggttttc 3180 

taatggagta tatagtttag tcatttcaca gactctggcc tcctctcctg aaatcctttt 3240 

ggatggggaa agggaaggtg gggagggtcc gaggggaagg ggaccccagc ttccctgtgc 3300 

ccgctcaccc cactccacca gtccccggtc gccagccgga gtctcctctc taccgccact 3360 

gtcacaccgt agcccacatg gatagcacag ttgtcagaca agattccttc agattccgag 34 20 

ttgctaccgg ttgttttcgt tgttgttgtt gttgtttttc tttttctttt tttttttgaa 3480 

gacagcaata accacagtac atattactgt agttctctat agttttacat acattcatac 3540 

cataactctg ttctctcctc ttttttgttt tcaactttaa aaacaaaaat aaacgatgat 3600 

aatctttact ggtgaaaagg atggaaaaat aaatcaacaa atgcaaccag tttgtgagaa 3660 
aaaaaaaaaa aa 3672 

<210> 30 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> L27560 
<400> 30 

agcaacctgg gccttgtact gtctgtgttt ttaaaaccac taaagtgcaa gaattacatt 60 

<210> 31 
<211> 1416 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> Modifiedjoase 
<222> 1 . . . 1416 
<223> n = a,c,g, or t 

<300> 

<308> M55914 
<400> 31 

aggaattccg gaattccgga attccgatgg atggaacaga aaataaatct 
cgaacgccat tctgggggtg tcccttgccg tctgcaaagc tggtgccgtt 
tcccctgtac cgccacatcg cgtacttggc tggcaacttc gaagtcatcc 
ggcgttcaag tgtcatcatc aatggcggtt ctcatgctgg caacaagctg 
gtctgtcctc ccagtcggtg cagcaaactc agggaagcca tgccgcattg 
ttaccacaac ctgaagaatg tcatcaagga gaaatatggg aaagatgcca 
gatttgcgcg ggtttgctcc caacatcctg gagaataaag aaggcctgga 
actgctattg gaaagcctgg cctacactgt aaaggtggtc atggcatgga 
tccgagttct tcaggtcagg gaactatgac ctggacttca agtctcccga 
aggtacatct cgcctgacca gctggctgac ctgtacaagt ccttcatcaa 
gtggtgtcta tcgaagatcc ctttgaccag gatgactggg gagcttcaga 
cagtgcagga atccaggtag tggggggatg actcacagtg accaacccaa 
caaggcgtga acgagaagtc ctgcaactgc ctcctgctca aagtcaacca 
gtgaccgagt ctcttcaggc gtgcaagctg gcccaggcca atggttgggg 
tctcatcgtt cgggggagac tgaagatacc ttcatcgctg acctggttgt 
actggggcag atcaagactg gtgccccttg ccgatcacgc gcttggccaa 



aagtttggtg 60 

gagaaggggg 120 

tgccagtccc 180 

gccatgcaga 240 

gagcagaggt 300 

ccaatgtggg 360 

gctgctgaag 420 

cgtagcggcc 480 

tgaccccagc 540 

ggactaccca 600 

agttcacagc 660 

agaggatcgc 720 

gattggctcc 780 

cgtcatggtg 840 

ggggctgtgc 900 

gtacaaccag 960 
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ctcctcagaa ttgaagagga gctgggcagc aaggctaagt ttgccggcag gaacttcaga 1020 

aaccccttgg ccaagtaagc tgtgggcagg caagccttcg gtcacctgtt ggctacagac 1080 

ccctcccctg gtgtcagctc aggcagctcg aggcccccga ccaacacttg caggggtccc 1140 

tgctagttag cgcccaccgc cgtggagttc gtaccgcttc cttagaactc tacagaagcc 1200 

aagctccctg gaagccctgt tggcagctct agctttgcag ttgtgtaatt ggcccaagtc 1260 

attgtttttc tcgccttact ttccaccaag tgtctagagt catgtgagcc tngtgtcatc 1320 

tccggggtgg ccacaggcta gatccccggt ggttttgtgc tcaaaataaa aagcctcagt 1380 

gacccatgaa aaaaaaaaag gaattccgga attccg 1416 

<210> 32 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> M55914 



<400> 32 

gtaccgcttc cttagaactc tacagaagcc aagctccctg gaagccctgt tggcagctct 60 

<210> 33 

<211> 2517 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> M96577 
<400> 33 

ggaattccgt ggccgggact ttgcaggcag cggcggccgg gggcggagcg ggatcgagcc 60 

ctcgccgagg cctgccgcca tgggcccgcg ccgccgccgc cgcctg.tcac ccgggccgcg 120 

cgggccgtga gcgtcatggc cttggccggg gcccctgcgg gcggcccatg cgcgccggcg 180 

ctggaggccc tgctcggggc cggcgcgctg cggctgctcg actcctcgca gatcgtcatc 240 

atctccgccg cgcaggacgc cagcgccccg ccggctccca ccggccccgc ggcgcccgcc 300 

gccggcccct gcgaccctga cctgctgctc ttcgccacac cgcaggcgcc ccggcccaca 360 

cccagtgcgc cgcggcccgc gctcggccgc ccgccggtga agcggaggct ggacctggaa 420 

actgaccatc agtacctggc cgagagcagt gggccagctc ggggcagagg ccgccatcca 480 

ggaaaaggtg tgaaatcccc gggggagaag tcacgctatg agacctcact gaatctgacc 540 

accaagcgct tcctggagct gctgagccac tcggctgacg gtgtcgtcga cctgaactgg 600 

gctgccgagg tgctgaaggt gcagaagcgg cgcatctatg acatcaccaa cgtccttgag 660 

ggcatccagc tcattgccaa gaagtccaag aaccacatcc agtggctggg cagccacacc 720 

acagtgggcg tcggcggacg gcttgagggg ttgacccagg acctccgaca gctgcaggag 780 

agcgagcagc agctggacca cctgatgaat atctgtacta cgcagctgcg cctgctctcc 840 

gaggacactg acagccagcg cctggcctac gtgacgtgtc aggaccttcg tagcattgca 900 

gaccctgcag agcagatggt tatggtgatc aaagcccctc ctgagaccca gctccaagcc 960 

gtggactctt cggagaactt tcagatctcc cttaagagca aacaaggccc gatcgatgtt 1020 

ttcctgtgcc ctgaggagac cgtaggtggg atcagccctg ggaagacccc atcccaggag 1080 

gtcacttctg aggaggagaa cagggccact gactctgcca ccatagtgtc accaccacca 1140 

tcatctcccc cctcatccct caccacagat cccagccagt ctctactcag cctggagcaa 1200 

gaaccgctgt tgtcccggat gggcagcctg cgggctcccg tggacgagga ccgcctgtcc 1260 

ccgctggtgg cggccgactc gctcctggag catgtgcggg aggacttctc cggcctcctc 1320 

cctgaggagt tcatcagcct ttccccaccc cacgaggccc tcgactacca cttcggcctc 1380 

gaggagggcg agggcatcag agacctcttc gactgtgact ttggggacct cacccccctg 1440 

gatttctgac agggcttgga gggaccaggg tttccagagt agctcacctt gtctctgcag 1500 

ccctggagcc ccctgtccct ggccgtcctc ccagcctgtt tggaaacatt taatttatac 1560 

ccctctcctc tgtctccaga agcttctagc tctggggtct ggctaccgct aggaggctga 1620 

gcaagccagg aagggaagga gtctgtgtgg tgtgtatgtg catgcagcct acacccacac 1680 

gtgtgtaccg ggggtgaatg tgtgtgagca tgtgtgtgtg catgtaccgg ggaatgaagg 1740 

tgaacataca cctctgtgtg tgcactgcag acacgcccca gtgtgtccac atgtgtgtgc 1800 

atgagtccat ctctgcgcgt gggggggctc taactgcact ttcggccctt ttgctcgtgg 1860 
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ggtcccacaa 


ggcccagggc 


agtgcctgct 


cccagaatct 


ggtgctctga 


ccaggccagg 


1920 


tggggaggct 




gggcgtgtag 


gacggugaga 


gcacttctgt 


cttaaaggtt 


1980 


ttttctgatt 


gaagctttaa 




tttatttatc 


gaggcctctt 


tggtgagcct 


2040 


ggggaatcag 


caaaagggga 


ggaggggtgt 


ggggttgata 


ccccaactcc 


ctctaccctt 


2100 


gagcaagggc 


aggggtccct 


gagctgttct 


tctgccccat 


actgaaggaa 


ctgaggcctg 


2160 


ggtgatttat 


ttattgggaa 


agtgagggag 


ggagacagac 


tgactgacag 


ccatgggtgg 


2220 


tcagatggtg 


gggtgggccc 


tctccagggg 


gccagttcag 


ggcccagctg 


ccccccagga 


2280 


tggatatgag 


atgggagagg 


tgagtggggg 


accttcactg 


atgtgggcag 


gaggggtggt 


2340 


gaaggcctcc 


ccoagcccag 


accctgtggt 


ccctcctgca 


gtgtctgaag 


cgcctgcctc 


2400 


cccactgctc 


tgccccaccc 


tccaatctgc 


actttgattt 


gcttcctaac 


agctctgttc 


2460 


cctcctgctt 


tggttttaat 


aaatattttg 


atgacgttaa 


aaaaaggaat 


tcgatat 2517 


<210> 34 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> M96577 












<400> 34 














gtaggacggt 


gagagcactt 


ctgtcttaaa 


ggttttttct 


gattgaagct 
g age 


ttaatggagc 


60 


<210> 35 














<211> 4437 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_000057 












<400> 35 














gcgcggcggc 


cgtggttgcg 


gcgcgggaag 


tttggatcct 


ggttccgtcc 


gctaggagtc 


60 


tgcgtgcgag 


gattatggct 


gctgttcctc 


aaaataatct 


acaggagcaa 


etagaaegtc 


120 


actcagccag 


aacacttaat 


aataaattaa 


gtctttcaaa 


accaaaattt 


tcaggtttca 


180 


cttttaaaaa 


gaaaacatct 


tcagataaca 


atgtatctgt 


aactaatgtg 


tcagtagcaa 


240 


aaacacctgt 


attaagaaat 


aaagatgtta 


atgttaccga 


agacttttcc 


ttcagtgaac 




ctctacccaa 


caccacaaat 


cagcaaaggg 


tcaaggactt 


ctttaaaaat 


gctccagcag 


360 


gacaggaaac 


acagagaggt 


ggatcaaaat 


cattattgcc 


agatttcttg 


cagactccga 


420 


aggaagttgt 


atgcactacc 


caaaacacac 


caactgtaaa 


gaaatccegg 


gatactgetc 


480 


tcaagaaatt 


agaatttagt 


tcttcaccag 


attctttaag 


taccatcaat 


gattgggatg 


540 


atatggatga 


ctttgatact 


tctgagactt 


caaaatcatt 


tgttacacca 


ccccaaagtc 


600 


actttgtaag 


agtaagcact 


gctcagaaat 


caaaaaaggg 


taagagaaac 


ttttttaaag 


660 


cacagcttta 


tacaacaaac 


acagtaaaga 


ctgatttgcc 


tccaccctcc 


tetgaaageg 


720 


agcaaataga 


tttgactgag 


gaacagaagg 


atgactcaga 


atggttaagc 


agcgatgtga 


780 


tttgcatcga 


tgatggcccc 


attgctgaag 


tgcatataaa 


tgaagatget 


caggaaagtg 


840 


actctctgaa 


aactcatttg 


gaagatgaaa 


gagataatag 


cgaaaagaag 


aagaatttgg 


900 


aagaagctga 


attacattca 


actgagaaag 


ttccatgtat 


tgaatttgat 


gatgatgatt 


960 


atgatacgga 


ttttgttcca 


ccttctccag 


aagaaattat 


ttctgettet 


tcttcctctt 


1020 


caaaatgcct 


tagtacgtta 


aaggaccttg 


acacat ctga 


cagaaaagag 


gatgttctta 


1080 


gcacatcaaa 


agatcttttg 


tcaaaacctg 


agaaaatgag 


tatgeaggag 


ctgaatccag 


1140 


aaaccagcac 


agactgtgac 


gctagacaga 


taagtttaca 


gcagcagctt 


attcatgtga 


1200 


tggagcacat 


ctgtaaatta 


attgatacta 


ttcctgatga 


taaactgaaa 


cttttggatt 


1260 


gtgggaacga 


actgcttcag 


cagcggaaca 


taagaaggaa 


acttctaacg 


gaagtagatt 


1320 


ttaataaaag 


tgatgccagt 


cttcttggct 


cattgtggag 


atacaggect 


gattcacttg 


1380 


atggccctat 


ggagggtgat 


tcctgcccta 


cagggaattc 


tatgaaggag 


ttaaattttt 


1440 


cacaccttcc 


ctcaaattct 


gtttctcctg 


gggactgttt 


actgactacc 


accctaggaa 


1500 


agacaggatt 


ctctgccacc 


aggaagaatc 


tttttgaaag 


gectttatte 


aatacccatt 


1560 


tacagaagtc 


ctttgtaagt 


agcaactggg 


ctgaaacacc 


aagactagga 


aaaaaaaatg 


1620 


aaagctctta 


tttcccagga 


aatgttctca 


caagcactgc 


tgtgaaagat 


cagaataaac 


1680 
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atactgcttc aataaatgac ttagaaagag aaacccaacc ttcctatgat attgataatt 1740 
ttgacataga tgactttgat gatgatgatg actgggaaga cataatgcat aatttagcag 1800 
ccagcaaatc ttccacagct gcctatcaac ccatcaagga aggtcggcca attaaatcag 1860 



ccagcaaatc ctccacayui. a^ 1 -'"-'"""' « " »- ^ . „,_„ = , i q 9 n 

tatcagaaag actttcctca gccaagacag actgtcttcc agtgtcatct actgctcaaa 1920 

atataaactt ctcagagtca attcagaatt atactgacaa gtcagcacaa aatttagcat 1980 

ccagaaatct gaaacatgag cgtttccaaa gtcttagttt tcctcataca aaggaaatga 2040 

tgaagatttt tcataaaaaa tttggcctgc ataattttag aactaatcag ctagaggcga 2100 

tcaatgctgc actgcttggt gaagactgtt ttatcctgat Q^gactgga Sgggtaaga 2160 

gtttgtgtta ccagctccct gcctgtgttt ctcctggggt cactgttgtc atttctccct 2220 

?gagatcact tatcgtagat caagtccaaa agctgacttc cttggatatt ccagctacat 2280 

a?c?gacagg tgataagact gactcagaag ctacaaatat ttacctccag ttatcaaaaa 2340 

aagacccaa? cataaaactt ctatatgtca ctccagaaaa gatctgtgca agtaacagac 2400 

tcatttctac tctggagaat ctctatgaga ggaagctctt ggcacgtttt Qttattgatg 2460 

aagcacattg tgtcagtcag tggggacatg attttcgtca agattacaaa agaatgaata 2520 

tgcttcgcca gaagtttcct tctgttccgg tgatggctct tacggccaca ^aatccca 2580 

gggtacagaa ggacatcctg actcagctga agattctcag acctcaggtg "tagcatga 2640 

gSttaacag acataatctg aaatactatg tattaccgaa aaagcctaaa aaggtggcat 2700 

?tgattgoct agaatggatc agaaagcacc acccatatga ttcagggata atttactgcc 2760 

tctccaggcg agaatgtgac accatggctg acacgttaca gagagatggg -tcgctgctc 2820 

ttgcttacca tgctggcctc agtgattctg ccagagatga agtgcagcag aagtggatta 2880 

atcaggatgg ctgtcaggtt atctgtgcta caattgcatt tggaatgggg attgacaaac 2940 

cggacgtgcg atttgtgatt catgcatctc tccctaaatc tgtggagggt tactaccaag 3000 

aaLtggcag agctggaaga gatggggaaa tatctcactg cctgcttttc tatacctatc 3060 

atgatgtgac cagactgaaa agacttataa tgatggaaaa agatggaaac catcatacaa 3120 

gagaaac?ca ctLaataat ttgtatagca tggtacatta ctgtgaaaat ataacggaat 3180 

gcaggagaat acagcttttg gcctactttg gtgaaaatgg atttaatcct gatttttgta 3240 

agaaacaccc agatgtttct tgtgataatt gctgtaaaac aaaggattat "aacaagag 3300 

atgtgactga cgatgtgaaa agtattgtaa gatttgttca agaacatagt tcatcacaag 3360 

gaatgagaaa tataaaacat gtaggtcctt ctggaagatt tactatgaat atgctggtcg 3 20 

acattttctt ggggagtaag agtgcaaaaa tccagtcagg tatatttgga aaaggatctg 3480 

££££ 224^ ,Uact« s »«2" llll 



cttattcacg acacaangcc gddaya^uu-. u _ finn 

atgaagactt atatatcaat gccaatgacc aggcgatcgc ttatgtgatg ctcggaaata 3600 

ccccagggat aagcctgtcc agcagcagag gccccggaag aagtgccgct gaggagcttg 4020 

■a-3'3 i.4.„-,__„vaa aafr-anaaah aaaaaaaaaa 



atgaagactt atataicaai a ^at.i.<j=v.v 3 _ " , „ = 

aagcccaaac tgtactaaat ggcaatttaa aggtagactt tatggaaaca gaaaattcca 3660 
i.j j „,o = ^^,t-^Tt-/- tfanaminas aacrataatta 

atcctgaggt tttgcttcaa attgatggtg ttactgaaga caaactggaa aaatatggtg 3900 



aagcccaaac cgtaccadai. yy ,_aa^v.~- —a* — a , 

gcagtgtgaa aaaacaaaaa gcgttagtag caaaagtgtc tcagagggaa gagatggtta 3720 

aaaaatgtct tggagaactt acagaagtct gcaaatctct ggggaaagtt "tggtgtcc 3780 

attacttcaa tatttttaat accgtcactc tcaagaagct tgcagaatct ttatcttctg 3840 



atcctgaggt utgctttdd ol^ov.^^ ..v.^-..-,— 3 _ , 0£n 

cggaaqtgat ttcagtatta cagaaatact ctgaatggac atcgccagct gaagacagtt 3960 

ccccagggat aagcctgtcc agcagcagag gccccggaag aagtgccgct gaggagcttg 4020 

acgaggaaat acccgtatct tcccactact ttgcaagtaa aaccagaaat gaaaggaaga 4080 

ggaaSagat gccagcctcc caaaggtcta agaggagaaa aactgcttcc agtggttcca 4140 

aggcaaaggg ggggtctgcc acatgtagaa agatatcttc caaaacgaaa tcctccagca 4200 

tcattggatc cagttcagcc tcacatactt ctcaagcgac atcaggagcc aatagcaaat 4260 

tggggattat ggctccaccg aagcctataa atagaccgtt tcttaagcct tcatatgcat 4320 

Staaca accgaatctc aatgtacata gaccctcttt cttgtttgtc agcatctgac 4380 
catctgtgac tataaagctg ttattcttgt tataccaaaa aaaaaaaaaa aaaaaaa 4437 



<210> 36 

<211> 60 

<212> DNA. 

<213> Homo sapiens 

<300> 

<308> NM 000057 



<400> 36 ...... 

taagccttca tatgcattct cataacaacc gaatctcaat gtacatagac cctctttctt 



<210> 37 
<211> 2016 
<212> DNA 
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<213> Homo sapiens 
<300> 

<308> NM_000060 
<400> 37 

gccagctgga gcgttttcgg ggctgtaaag ggagaatggc gcatgcgcat attcagggcg 60 
gaaggcgcgc taagagcaga tttgtggtct gcattatgtc tggagccaga agtaagcttg 120 
ctcttttcct ctgcggctgt tacgtggttg ccctgggagc ccacaccggg gaggagagcg 180 
tggctgacca tcacgaggct gaatattatg tggctgccgt gtatgagcat ccatccatcc 240 
tgagtctgaa ccctctggct ctcatcagcc gccaagaggc cttggagctc atgaaccaga 300 

accttgacat ctatgaacag caagtgatga ctgcagccca aaaggatgta cagattatag 360 

tgtttccaga agatggcatt catggattca actttacaag aacatccatt tatccatttt 420 

tggacttcat gccgtctccc caggtggtca ggtggaaccc atgcctggag cctcaccgct 4 80 

tcaatgacac agaggtgctc cagcgcctga gttgtatggc catcagggga gatatgttct 540 

tggtggccaa tcttgggaca aaggagcctt gtcatagcag tgacccaagg tgcccaaaag 600 

atgggagata ccagttcaac acaaatgtcg tgttcagcaa taatggaacc cttgttgacc 660 

gctaccgtaa acacaacctc tactttgagg cagcattcga tgttcctctt aaagtggatc 720 

tcatcacctt tgataccccc tttgctggca ggtttggcat cttcacatgc tttgatatat 780 

tgttotttga ccctgccatc agagtcctca gagactacaa ggtgaagcat gttgtgtacc 840 

caactgcctg gatgaaccag ctcccactct tggcagcaat tgagattcag aaagcttttg 900 

ctgttgcctt tggcatcaac gttctggcag ctaatgtcca ccacccagtt ctggggatga 960 

caggaagtgg catacacacc cctctggagt ccttttggta ccatgacatg gaaaatccca 1020 

aaagtcacct tataattgcc caggtggcca aaaatccagt gggtctcatt ggtgcagaga 1080 

atgcaacagg tgaaacggac ccatcccata gtaagttttt aaaaattttg tcaggcgatc 1140 

cgtactgtga gaaggatgct caggaagtcc actgtgatga ggccaccaag tggaacgtga 1200 

atgctcctcc cacatttcac tctgagatga tgtatgacaa tttcaccctg gtccctgtct 1260 

ggggaaagga aggctatctc cacgtctgtt ccaatggcct ctgctgttat ttactttacg 1320 

agaggcccac cttatccaaa gagctgtatg ccctgggggt ctttgatggg cttcacacag 1380 

tacatggcac ttactacatc caagtgtgtg ccctggtcag gtgtgggggt cttggcttcg 1440 

acacctgcgg acaggaaatc acagaggcca cggggatatt tgagtttcac ctgtggggca 1500 

acttcagtac ttcctatatc tttcctttgt ttctgacctc agggatgacc ctagaagtcc 1560 

ctgaccagct tggctgggag aatgaccact atttcctgag gaaaagtagg ctgtcctctg 1620 

ggctggtgac ggcggctctc tatgggcgct tgtatgagag ggactaggaa aagtgtgtgg 1680 

tctgtggggc ggactctggc catcatgttg acagccttgc acttccacag gctacaagcc 1740 

ctgggaccat ctttctgcct taagggcagg agcccacttc tgtggcacca gattccaccc 1800 

tgggaactgt ggaaaaagta ggagaggcag attccctcag tgtcttcctc ttaaacctca 1860 

atcatcgaga cattaggggg tattttctgt tcacatttat ctttttcaag ccacatcttc 1920 

ctctaacaaa tctctcagta tgcgattggt ctcaagctaa aacaaaaata aatgtcagtt 1980 
tatattttac acatccaaaa aaaaaaaaaa aaaaaa 2016 

<210> 38 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000060 
<400> 38 

tcctctaaca aatctctcag tatgcgattg gtctcaagct aaaacaaaaa taaatgtcag 60 

<210> 39 

<211> 811 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM 000269 
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<400> 39 

gcagaagcgt 

tgctggcggc 

gagcgtacct 

atcaagcgtt 

gaagatcttc 

gtgaaataca 

aagacgggcc 

cgtggagact 

agtgcagaga 

tgtgctcaga 

tttcccctcc 

cttcatcata 

atccccgacc 

tcatattgtt 



tccgtgcgtg 
tgcagccgga 
tcattgcgat 
ttgagcagaa 
tcaaggaaca 
tgcactcagg 
gagtcatgct 
tctgcataca 
aggagatcgg 
actggatcta 
ttcccatggg 
atttggaggg 
atctgattaa 
gcattgcttt 



caagtgctgc 
gttcaaacct 
caaaccagat 
aggattccgc 
ctacgttgac 
gccggtagtt 
cggggagacc 
agttggcagg 
cttgtggttt 
tgaatgacag 
cagaggacca 
aagctcttgg 
aatgcttcct 
tttttccttc 



gaaccacgtg 
aagcagctgg 
ggggtccagc 
cttgttggtc 
ctgaaggacc 
gccatggtct 
aaccctgcag 
aacattatac 
caccctgagg 
gagggcagac 
ggctgtagga 
agctgtgagt 
cccagcatag 
t 811 



ggtcccgggc 
aaggaaccat 
ggggtcttgt 
tgaaattcat 
gtccattctt 
gggaggggct 
actccaagcc 
atggcagtga 
aactggtaga 
cacattgctt 
aatctagtta 
tctccctgta 
gattcattga 



gcgtttcggg 
ggccaactgt 
gggagagatt 
gcaagcttcc 
tgccggcctg 
gaatgtggtg 
tgggaccatc 
ttctgtggag 
ttacacgagc 
ttcacatcca 
tttacaggaa 
cagtgttacc 
gttggttact 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



<210> 40 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000269 
<400> 40 

gtctgaaatt catgcaagct tccgaagatc ttctcaagga acactacgtt gacctgaagg 

<210> 41 

<211> 2338 

<212> DNA 

<213> Homo sapiens 



<400> 41 

agcgcacgtc 

atttccaaaa 

cgggtcgtta 

cagaggatta 

gtagtcctta 

ttagagccag 

gactgtgtag 

ctgctggaga 

aacaaggtta 

ggggatgtct 

ggagtcaatc 

gcaaaggcct 

gcagacaaga 

ggtggaatgg 

tttgatgaag 

gtgaagatta 

actggccaag 

cctgaaagca 

ggtcctgtgg 

gaggtggtga 

acttgctgtg 

gccagtttgg 

tagtactttc 

catctccact 

tgcagtgcca 



ggcagtcggc 
tgtcgctttc 
tgagagtcga 
aggctgctgt 
tgagccacct 
ttgctgtaga 
gcccagaagt 
acctccgctt 
aagccgagcc 
atgtcaatga 
tgccacagaa 
tggagagccc 
tccagctcat 
cttttacctt 
agggagccaa 
ccttgcctgt 
ccactgtggc 
gcaagaagta 
gggtatttga 
aagccacttc 
ccaaatggaa 
agctcctgga 
ctgcctttta 
tggcattagc 
ggaaccctta 



tccctcgttg 
taacaagctg 
cttcaatgtt 
cccaagcatc 
aggccggcct 
actcaaatct 
ggagaaagcc 
tcatgtggag 
agccaaaata 
tgcttttggc 
ggctggtggg 
agagcgaccc 
caataatatg 
ccttaaggtg 
gattgtcaaa 
tgactttgtc 
ttctggcata 
tgctgaggct 
atgggaagct 
taggggctgc 
cacggaggat 
aggtaaagtc 
gttcctgtgc 
taaaaccttc 
aacagttgca 



accgaatcac 
acgctggaca 
cctatgaaga 
aaattctgct 
gatggtgtgc 
ctgctgggca 
tgtgccaacc 
gaagaaggga 
gaagctttcc 
actgctcaca 
tttttgatga 
ttcctggcca 
ctggacaaag 
ctcaacaaca 
gacctaatgt 
actgctgaca 
cctgctggct 
gtcactcggg 
tttgcccggg 
atcaccatca 
aaagtcagcc 
cttcctgggg 
acagccccta 
catgtcaaga 
cagcatctca 



cgacctctct 
agctggacgt 
acaaccagat 
tggacaatgg 
ccatgcctga 
aggatgttct 
cagctgctgg 
agggaaaaga 
gagcttcact 
gagcccacag 
agaaggagct 
tcctgggcgg 
tcaatgagat 
tggagattgg 
ccaaagctga 
agtttgatga 
ggatgggctt 
ctaagcagat 
gaaccaaagc 
taggtggtgg 
atgtgagcac 
tggatgctct 
agtcaactta 
ttcagctagt 
gctcatcttc 



ccccagctgt 
taaagggaag 
aacaaacaac 
agccaagtcg 
caagtactcc 
gttcttgaag 
gtctgtcatc 
tgcttctggg 
ttccaagcta 
ctccatggta 
gaactacttt 
agctaaagtt 
gattattggt 
cacttctctg 
gaagaatggt 
gaatgccaag 
ggactgtggt 
tgtgtggaat 
tctcatggat 
agacactgcc 
tgggggtggt 
cagcaatatt 
gcattttctg 
ggccaagaga 
actgcaccct 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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ggatttgcat acattcttca agatcccatt tgaatttttt agtgactaaa ccattgtgca 1560 

ttctagagtg catatattta tattttgcct gttaaaaaga aagtgagcag tgttagctta 1620 

gttctctttt gatgtaggtt attatgatta gctttgtcac tgtttcacta ctcagcatgg 1680 

aaacaagatg aaattccatt tgtaggtagt gagacaaaat tgatgatcca ttaagtaaac 1740 

aataaaagtg tccattgaaa ccgtgatttt tttttttttc ctgtcatact ttgttaggaa 1800 

gggtgagaat agaatcttga ggaacggatc agatgtctat attgctgaat gcaagaagtg 1860 

gggcagcagc agtggagaga tgggacaatt agataaatgt ccattcttta tcaagggcct 1920 

actttatggc agacattgtg ctagtgcttt tattctaact tttattttta tcagttacac 1980 

atgatcataa tttaaaaagt caaggcttat aacaaaaaag ccccagccca ttcctcccat 204 0 

tcaagattcc cactccccag aggtgaccac tttcaactct tgagtttttc aggtatatac 2100 

ctccatgttt ctaagtaata tgcttatatt gttcacttcc ttttttttta ttttttaaag 2160 

aaatctattt cataccatgg aggaaggctc tgttccacat atatttccac ttcttcattc 2220 

tctcggtata gttttgtcac aattatagat tagatcaaaa gtctacataa ctaatacagc 2280 

tgagctatgt agtatgctat gattaaattt acttatgtaa aaaaaaaaaa aaaaaaaa 2338 

<210> 42 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000291 
<400> 42 

acttagcatt ttctgcatct ccacttggca ttagctaaaa ccttccatgt caagattcag 60 

<210> 43 

<211> 787 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000363 



ctgaaggtca 


cccgggcggc 


cccctcactg 


accctccaaa 


cgcccctgtc 


ctcgccctgc 


60 


ctcctgccat 


tcccggcctg 


agtctcagca 


tggcggatgg 


gagcagcgat 


gcggctaggg 


120 


aacctcgccc 


tgcaccagcc 


ccaatcagac 


gccgctcctc 


caactaccgc 


gcttatgcca 


180 


cggagccgca 


cgccaagaaa 


aaatctaaga 


tctccgcctc 


gagaaaattg 


cagctgaaga 


240 


ctctgctgct 


gcagattgca 


aagcaagagc 


tggagcgaga 


ggcggaggag 


cggcgcggag 


300 


agaaggggcg 


cgctctgagc 


acccgctgcc 


agccgctgga 


gttgaccggg 


ctgggcttcg 


360 


cggagctgca 


ggacttgtgc 


cgacagctcc 


acgcccgtgt 


ggacaaggtg 


gatgaagaga 


420 


gatacgacat 


agaggcaaaa 


gtcaccaaga 


acatcacgga 


gattgcagat 


ctgactcaga 


480 


agatctttga 


ccttcgaggc 


aagtttaagc 


ggcccaccct 


gcggagagtg 


aggatctctg 


540 


cagatgccat 


gatgcaggcg 


ctgctggggg 


cccgggctaa 


ggagtccctg 


gacctgcggg 


600 


cccacctcaa 


gcaggtgaag 


aaggaggaca 


ccgagaagga 


aaaccgggag 


gtgggagact 


660 


ggcggaagaa 


catcgatgca 


ctgagtggaa 


tggagggccg 


caagaaaaag 


tttgagagct 


720 


gagccttcct 


gcctactgcc 


cctgccctga 


ggagggccac 


tgaggaataa 


agcttctctc 


780 


tgagctg 787 












<210> 44 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_000363 












<400> 44 














tgtggacaag 


gtggatgaag 


agagatacga 


catagaggca 


aaagtcacca 


agaacatcac 


60 



<400> 43 
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<210> 45 

<211> 1263 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000365 
<400> 45 

ggcacgagac cttcagcgcc tcggctccag cgccatggcg ccctccagga agttcttcgt 60 

tgggggaaac tggaagatga acgggcggaa gcagagtctg ggggagctca tcggcactct 120 

gaacgcggcc aaggtgccgg ccgacaccga ggtggtttgt gctcccccta ctgcctatat 180 

cgacttcgcc cggcagaagc tagatcccaa gattgctgtg gctgcgcaga actgctacaa 240 

agtgactaat ggggctttta ctggggagat cagccctggc atgatcaaag actgcggagc 300 

cacgtgggtg gtcctggggc actcagagag aaggcatgtc tttggggagt cagatgagct 360 

gattgggcag aaagtggccc atgctctggc agagggactc ggagtaatcg cctgcattgg 420 

ggagaagcta gatgaaaggg aagctggcat cactgagaag gttgttttcg agcagacaaa 480 

ggtcatcgca gataacgtga aggactggag caaggtcgtc ctggcctatg agcctgtgtg 540 

ggccattggt actggcaaga ctgcaacacc ccaacaggcc caggaagtac acgagaagct 600 

ccgaggatgg ctgaagtcca acgtctctga tgcggtggct cagagcaccc gtatcattta 660 

tggaggctct gtgactgggg caacctgcaa ggagctggcc agccagcctg atgtggatgg 720 

cttccttgtg ggtggtgctt ccctcaagcc cgaattcgtg gacatcatca atgccaaaca 780 

atgagcccca tccatcttcc ctacccttcc tgccaagcca gggactaagc agcccagaag 840 

cccagtaact gccctttccc tgcatatgct tctgatggtg tcatctgctc cttcctgtgg 900 

cctcatccaa actgtatctt cctttactgt ttatatcttc accctgtaat ggttgggacc 960 

aggccaatcc cttctccact tactataatg gttggaacta aacgtcacca aggtggcttc 1020 

tccttggctg agagatggaa ggcgtggtgg gatttgctcc tgggttccct aggccctagt 1080 

gagggcagaa gagaaaccat cctctccctt cttacaccgt gaggccaaga tcccctcaga 1140 

aggcaggagt gctgccctct cccatggtgc ccgtgcctct gtgctgtgta tgtgaaccac 1200 

ccatgtgagg gaataaacct ggcactagga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1260 
aaa 1263 

<210> 46 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000365 
<400> 46 

tatcttcacc ctgtaatggt tgggaccagg ccaatccctt ctccacttac tataatggtt 60 

<210> 47 

<211> 1616 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000582 
<400> 47 

ctccctgtgt tggtggagga tgtctgcagc agcatttaaa ttctgggagg gcttggttgt 60 

cagcagcagc aggaggaggc agagcacagc atcgtcggga ccagactcgt ctcaggccag 120 

ttgcagcctt ctcagccaaa cgccgaccaa ggaaaactca ctaccatgag aattgcagtg 180 

atttgctttt gcctcctagg catcacctgt gccataccag ttaaacaggc tgattctgga 240 

agttctgagg aaaagcagct ttacaacaaa tacccagatg ctgtggccac atggctaaac 300 

cctgacccat ctcagaagca gaatctccta gccccacaga cccttccaag taagtccaac 360 

gaaagccatg accacatgga tgatatggat gatgaagatg atgatgacca tgtggacagc 420 

caggactcca ttgactcgaa cgactctgat gatgtagatg acactgatga ttctcaccag 480 
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tctgatgagt ctcaccattc tgatgaatct gatgaactgg tcactgattt tcccacggac 540 

ctgccagcaa ccgaagtttt cactccagtt gtccccacag tagacacata tgatggccga 600 

ggtgatagtg tggtttatgg actgaggtca aaatctaaga agtttcgcag acctgacatc 660 

cagtaccctg atgctacaga cgaggacatc acctcacaca tggaaagcga ggagttgaat 720 

ggtgcataca aggccatccc cgttgcccag gacctgaacg cgccttctga ttgggacagc 780 

cgtgggaagg acagttatga aacgagtcag ctggatgacc agagtgctga aacccacagc 840 

cacaagcagt ccagattata taagcggaaa gccaatgatg agagcaatga gcattccgat 900 

gtgattgata gtcaggaact ttccaaagtc agccgtgaat tccacagcca tgaatttcac 960 

agccatgaag atatgctggt tgtagacccc aaaagtaagg aagaagataa acacctgaaa 1020 

tttcgtattt ctcatgaatt agatagtgca tcttctgagg tcaattaaaa ggagaaaaaa 1080 

tacaatttct cactttgcat ttagtcaaaa gaaaaaatgc tttatagcaa aatgaaagag 1140 

aacatgaaat gcttctttct cagtttattg gttgaatgtg tatctatttg agtctggaaa 1200 

taactaatgt gtttgataat tagtttagtt tgtggcttca tggaaactcc ctgtaaacta 1260 

aaagcttcag ggttatgtct atgttcattc tatagaagaa atgcaaacta tcactgtatt 1320 

ttaatatttg ttattctctc atgaatagaa atttatgtag aagcaaacaa aatactttta 1380 

cccacttaaa aagagaatat aacattttat gtcactataa tcttttgttt tttaagttag 1440 

tgtatatttt gttgtgatta tctttttgtg gtgtgaataa atcttttatc ttgaatgtaa 1500 

taagaatttg gtggtgtcaa ttgcttattt gttttcccac ggttgtccag caattaataa 1560 
aacataacct tttttactgc ctaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1616 

<210> 48 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000582 
<400> 48 

ggtggtgtca attgcttatt tgttttccca cggttgtcca gcaattaata aaacataacc 60 

<210> 49 
<211> 1666 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_000584 
<400> 49 

ctccataagg cacaaacttt cagagacagc agagcacaca agcttctagg acaagagcca 
ggaagaaacc accggaagga accatctcac tgtgtgtaaa catgacttcc aagctggccg 
tggctctctt ggcagccttc ctgatttctg cagctctgtg tgaaggtgca gttttgccaa 
ggagtgctaa agaacttaga tgtcagtgca taaagacata ctccaaacct ttccacccca 
aatttatcaa agaactgaga gtgattgaga gtggaccaca ctgcgccaac acagaaatta 
ttgtaaagct ttctgatgga agagagctct gtctggaccc caaggaaaac tgggtgcaga 
gggttgtgga gaagtttttg aagagggctg agaattcata aaaaaattca ttctctgtgg 
tatccaagaa tcagtgaaga tgccagtgaa acttcaagca aatctacttc aacacttcat 
gtattgtgtg ggtctgttgt agggttgcca gatgcaatac aagattcctg gttaaatttg 
aatttcagta aacaatgaat agtttttcat tgtaccatga aatatccaga acatacttat 
atgtaaagta ttatttattt gaatctacaa aaaacaacaa ataattttta aatataagga 
ttttcctaga tattgcacgg gagaatatac aaatagcaaa attgaggcca agggccaaga 
gaatatccga actttaattt caggaattga atgggtttgc tagaatgtga tatttgaagc 
atcacataaa aatgatggga caataaattt tgccataaag tcaaatttag ctggaaatcc 
tggatttttt tctgttaaat ctggcaaccc tagtctgcta gccaggatcc acaagtcctt 
gttccactgt gccttggttt ctcctttatt tctaagtgga aaaagtatta gccaccatct 
tacctcacag tgatgttgtg aggacatgtg gaagcacttt aagttttttc atcataacat 
aaattatttt caagtgtaac ttattaacct atttattatt tatgtattta tttaagcatc 
aaatatttgt gcaagaattt ggaaaaatag aagatgaatc attgattgaa tagttataaa 
gatgttatag taaatttatt ttattttaga tattaaatga tgttttatta gataaatttc 
aatcagggtt tttagattaa acaaacaaac aattgggtac ccagttaaat tttcatttca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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gataaacaac aaataatttt ttagtataag tacattattg tttatctgaa attttaattg 1320 

aactaacaat cctagtttga tactcccagt cttgtcattg ccagctgtgt tggtagtgct 1380 

gtgttgaatt acggaataat gagttagaac tattaaaaca gccaaaactc cacagtcaat 1440 

attagtaatt tcttgctggt tgaaacttgt ttattatgta caaatagatt cttataatat 1500 

tatttaaatg actgcatttt taaatacaag gctttatatt tttaacttta agatgttttt 1560 

atgtgctctc caaatttttt ttactgtttc tgattgtatg gaaatataaa agtaaatatg 1620 
aaacatttaa aatataattt gttgtcaaag taaaaaaaaa aaaaaa 1666 

<210> 50 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD00584 
<400> 50 

tggtagtgct gtgttgaatt acggaataat gagttagaac tattaaaaca gccaaaactc 60 

<210> 51 

<211> 1722 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_000599 
<400> 51 

ggggaaaaga gctaggaaag agctgcaaag 
ttttcattac ccctcctccg ttttcaccct 
tcgaagagga ggtggcaaag tgggagaaaa 
ttgtttttgt tttttaattt cttgatttca 
aacgcctctt acctgttctg cggcgccgcg 
ggtgtatttt agattttaag caaaaatttt 
caacgccatc tccactgcat ccgatctcat 
tttgtggctt tttttcccct ataattctga 
ccgctcactg cgtgcacctg gcgctgccct 
tcttgcaact gggacctgct cgcaggcacc 
aagtgtctgg gggagggcac , ctgctctacc 
aaaaaaatct ccgggggccc tcttggcccc 
accccgaggt aaagggggcg actaagagaa 
gctggccgcc tatgcggggc cggcccagag 
cgacgagaaa gccctctcca tgtgcccccc 
gccgggctgc ggctgctgca tgacctgcgc 
caccgagcgc tgcgcccagg ggctgcgctg 
gcacgccctg ctgcacggcc gcggggtttg 
caagatcgag agagactccc gtgagcacga 
gacctactcc cccaagatct tccggcccaa 
agcagtgaag aaggaccgca gaaagaagct 
gaacactgcc cacccccgga tcatctctgc 
cccctgccgc agacacatgg aggcttccct 
gccccgtgct gtgtacctgc ccaattgtga 
caaaccttcc cgtggccgca agcgtggcat 
gctgccaggc atggagtacg ttgacgggga 
cgttgagtga tgcgtccccc cccaaccttt 
tccagccagc gcctccctcc accccaggac 
atatatatct atctatttga ggaaaaaaaa 

<210> 52 
<211> 60 



cagtgtgggc tttttccctt tttttgctcc 60 

tctccggact tcgcgtagaa cctgcgaatt 120 

gaggtgttag ggtttggggt ttttttgttt 180 

acattttctc ccaccctctc ggctgcagcc 240 

caccgctggc agctgagggt tagaaagcgg 300 

aaagataaat ccatttttct ctcccacccc 360 

tatttcggtg gttgcttggg ggtgaacaat 420 

cccgctcagg cttgagggtt tctccggcct 480 

gcttccccca acctgttgca aggctttaat 540 

ccagccctcc acctctctct acatttttgc 600 

tgccagaaat tttaaaacaa aaacaaaaac 660 

tttatccctg cactctcgct ctcctgcccc 720 

gatggtgttg ctcaccgcgg tcctcctgct 780 

cctgggctcc ttcgtgcact gcgagccctg 840 

cagccccctg ggctgcgagc tggtcaagga 900 

cctggccgag gggcagtcgt gcggcgtcta 960 

cctcccccgg caggacgagg agaagccgct 1020 

cctcaacgaa aagagctacc gcgagcaagt 1080 

ggagcccacc acctctgaga tggccgagga 1140 

acacacccgc atctccgagc tgaaggctga 1200 

gacccagtcc aagtttgtcg ggggagccga 12 60 

acctgagatg agacaggagt ctgagcaggg 1320 

gcaggagctc aaagccagcc cacgcatggt 1380 

ccgcaaagga ttctacaaga gaaagcagtg 1440 

ctgctggtgc gtggacaagt acgggatgaa 1500 

ctttcagtgc cacaccttcg acagcagcaa 1560 

ccctcacccc ctcccacccc cagccccgac 1620 

gccactcatt tcatctcatt taagggaaaa 1680 
aaaaaaaaaa aa 1722 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000599 
<400> 52 

ccaggacgcc actcatttca tctcatttaa gggaaaaata tatatctatc tatttgagga 60 

<210> 53 
<211> 704 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_000735 



gcagttactg agaactcata agacgaagct aaaatccctc ttcggatcca cagtcaaccg 60 

ccctgaacac atcctgcaaa aagcccagag aaaggagcgc catggattac tacagaaaat 120 

atgcagctat ctttctggtc acattgtcgg tgtttctgca tgttctccat tccgctcctg 180 

atg'tgcagga ttgcccagaa tgcacgctac aggaaaaccc attcttctcc cagccgggtg 24 0 

ccccaatact tcagtgcatg ggctgctgct tctctagagc atatcccact ccactaaggt 300 

ccaagaagac gatgttggtc caaaagaacg tcacctcaga gtccacttgc tgtgtagcta 360 

aatcatataa cagggtcaca gtaatggggg gtttcaaagt ggagaaccac acggcgtgcc 420 

actgcagtac ttgttattat cacaaatctt aaatgtttta ccaagtgctg tcttgatgac 480 

tgctgatttt ctggaatgga aaattaagtt gtttagtgtt tatggctttg tgagataaaa 540 

ctctcctttt ccttaccata ccactttgac acgcttcaag gatatactgc agctttactg 600 

ccttcctcct tatcctacag tacaatcagc agtctagttc ttttcatttg gaatgaatac 660 
agcattaagc ttgttccact gcaaataaag ccttttaaat catc 704 

<210> 54 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_00073S 



tgagataaaa ctctcctttt ccttaccata ccactttgac acgcttcaag gatatactgc 60 

<210> 55 

<211> 1342 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000799 
<400> 55 

cccggagccg gaccggggcc accgcgcccg ctctgctccg acaccgcgcc ccctggacag 60 

ccgccctctc ctccaggccc gtggggctgg ccctgcaccg ccgagcttcc cgggatgagg 120 

gcccccggtg tggtcacccg gcgcgcccca ggtcgctgag ggaccccggc caggcgcgga 180 

gatgggggtg cacgaatgtc ctgcctggct gtggcttctc ctgtccctgc tgtcgctccc 24 0 

tctgggcctc ccagtcctgg gcgccccacc acgcctcatc tgtgacagcc gagtcctgga 300 

gaggtacctc ttggaggcca aggaggccga gaatatcacg acgggctgtg ctgaacactg 360 

cagcttgaat gagaatatca ctgtcccaga caccaaagtt aatttctatg cctggaagag 420 

gatggaggtc gggcagcagg ccgtagaagt ctggcagggc ctggccctgc tgtcggaagc 480 

tgtcctgcgg ggccaggccc tgttggtcaa ctcttcccag ccgtgggagc ccctgcagct 540 

gcatgtggat aaagccgtca gtggccttcg cagcctcacc actctgcttc gggctctgcg 600 



<400> 53 



<400> 54 
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agcccagaag 


gaagccatct 


cccctccaga 


tgcggcctca 


gctgctccac 


tccgaacaat 


660 


cactgctgac 


actttccgca 


aactcttccg 


agtctactcc 


aatttcctcc 


ggggaaagct 


720 


gaagctgtac 


acaggggagg 


cctgcaggac 


aggggacaga 


tgaccaggtg 






ggcatatcca 


ccacctccct 


caccaacatt 


gcttgtgcca 


caccctcccc 


cgccactcct 


840 


gaaccccgtc 


gaggggctct 


cagctcagcg 


ccagcctgtc 


ccatggacac 


tccagtgcca 


900 


gcaatgacat 


ctcaggggcc 


agaggaactg 


tccagagagc 


aactctgaga 


tctaaggatg 


960 


tcacagggcc 






gaagcattca 


gagagcagct 


ttaaactcag 


1020 


ggacagagcc 


atgctgggaa 


gacgcctgag 




accctgcaaa 


atttgatgee 




aggacacgct 


ttggaggcga 


tttacctgtt 


ttcgcaccta 




caggatgacc 


1140 


tggagaactt 


aggtggcaag 


ctgtgacttc 


tccaggtctc 


aegggcatgg 


gcactccctt 


1200 


ggtggcaaga 


gcccccttga 


caccggggtg 


gtgggaacca 


tgaagacagg 


atgggggctg 


1260 




ctcatggggt 


ccaagttttg 


tgtattcttc 


aacctcattg 


acaagaactg 


1320 


aaaccaccaa 


aaaaaaaaaa 


aa 1342 










<210> 56 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_000799 












<400> 56 














tcatggggtc caagttttgt 


gtattcttca 


acctcattga 


caagaactga 


aaccaccaaa 


60 


<210> 57 














<211> 2722 














<212> DNA 














<213> Homo sapiens 












<300> 














<308> NMJ300917 












<400> 57 














gagcgggctg 


agggtaggaa 


qta cc etc 


cgag ggagg 


cgac ggggg 


ctgaagagcg 


60 


cgccgccctc 


tcgtcccact 


tt t 


g ga cc g 


aaaattaaat 


cttccaagat 


120 


gatctggtat 


atattaatta 






tctttggctc 


atccaggctt 


180 


ttttacttca 


attggtcaga 


gac ga 


4- 

gatccatact 


gagaaagatc 


tggtgacttc 


240 


tctgaaagat 


tatattaagg 


agaagagga 


caagttagaa 


caaataaaaa 


aatgggcaga 


300 


gaagttagat 


cggctaacta 




aaaaga cca 


gaaggatttg 


ttgggcatcc 


360 


agtaaatgca 


ttcaaattaa 


tgaaaegtet 




tggagtgagt 


tggagaatct 


420 


ggtccttaag 


gatatgtcag 


atggctttat 


ctctaaccta 




gaccagtact 




ttctaatgat 


gaagatcagg 


ttggggcagc 




a g c cc 


agga accta 


5 n 
0 


caatttggat 


acagatacca 




taatcttc a 




acaaatcttt 


600 


tctaacggct 


gaggactgct 


ttgagttggg 


caaagtggcc 


tatacagaag 


cagattatta 




ccatacggaa 


ctgtggatgg 


aacaagccct 


aaggcaactg 


gatgaaggcg 


agatttctac 




catagataaa 


gtctctgttc 


tagattattt 


gagctatgeg 




agggagacct 




ggataaggca 


cttttgctca 


caaagaagct 


tcttgaacta 


gatcctgaac 


atcagagagc 


R4n 


taatggtaac 


ttaaaatatt 


ttgagtatat 


aatggctaaa 








tgcttcagat 


gaccaatctg 


atcagaaaac 


tacaccaaag 


aaaaaagggg 


^aa aag c 

9 g gga 




ttacctgcca 


gagagacaga 


agtacgaaat 


gctgtgccgt 


gggg^ggg^- a 






ccctcggaga 


cagaaaaaac 


tettttgecg 


ctaccatgat 




atcctaaatt 


1080 


tattctggct 


ccagctaaac 


aggaggatga 


atgggacaag 


ectegtatta 


ttcgcttcca 


1140 


tgatattatt 


tctgatgcag 


aaattgaaat 


egtcaaagae 


ctagcaaaac 


caaggctgag 


1200 


ccgagctaca 


gtacatgacc 


ctgagactgg 


aaaattgacc 


acagcacagt 


acagagtatc 


1260 


taagagtgcc 


tggctctctg 


gctatgaaaa 


tcctgtggtg 


tctcgaatta 


atatgagaat 


1320 


acaagatcta 


acaggactag 


atgtttccac 


agcagaggaa 


ttacaggtag 


caaattatgg 


1380 


agttggagga 


cagtatgaac 


cccattttga 


etttgeaegg 


aaagatgagc 


cagatgettt 


1440 


caaagagctg 


gggacaggaa 


atagaattgc 


tacatggctg 


ttttatatga 


gtgatgtgtc 


1500 


tgcaggagga 


gccactgttt 


ttcctgaagt 


tggagctagt 


gtttggccca 


aaaaaggaac 


1560 


tgctgttttc 


tggtataatc 


tgtttgccag 


tggagaagga 


gattatagta 


cacggcatgc 


1620 
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agcctgtcca 


gtgctagttg 


gcaacaaatg 


ggtatccaat 


aaatggctcc 


atgaacgtgg 


1680 


acaagaattt 


cgaagacctt 


gtacgttgtc 


agaattggaa 


tgacaaacag 


gcttcccttt 


1740 


ttctcctatt 


gttgtactct 


tatgtgtctg 


atatacacat 


ttccatagtc 


ttaactttca 


1800 


ggagtttaca 


attgactaac 


actccatgat 


tgattcagtc 


atgaacctca 


tcccatgttt 


1860 


catctgtgga 


caattgctta 


ctttgtgggt 


tcttttaaaa 


gtaacacgaa 


at cat cat at 


1920 


tgcataaaac 


cttaaagttc 


tgttggtatc 


acagaagaca 


aggcagagtt 




1980 


aattttatat 


ttaaagaact 


ttttggttgg 


ataaaaacat 


aatttgagca 


tccagtttta 


2040 


gtatttcact 


acatctcagt 


tggtgggtgt 


taagctagaa 


tgggctgtgt 






aaatgcctta 


cagatgtgcc 


taggtgttct 


gtttacctag 


tgtcttactc 


tgttttctgg 


2160 


atctgaagac 


tagtaataaa 


ctaggacact 


aactgggttc 






2220 


tgatcttcta 


agttgatttt 


tttcctccca 


agtctttttt 


aaagaaagta 


tactgtattt 


2280 


taccaacccc 


ctctcttttc 


ttttagctcc 


tctgtggtga 


attaaacgta 


cttgagttaa 


2340 




tttttttttt 


ttttttaatg 


gaaagtcctg 


cataacaaca 


ctgggccttc 


2400 


ttaactaaaa 


tgctcaccac 


ttagcctgtt 


tttttatccc 


ttttttaaaa 


tgacagatga 




ttttgttcag 




gtttttctta 


gtgctaatac 


cttgcctctt 


attcctgcta 


2520 


cagcagggtg 


gtaatattgg 


cattctgatt 


aaatactgtg 


ccttaggaga 


ctggaagttt 


2580 


aaaaatgtac 


aagtcctttc 


agtgatgagg 


gaattgattt 


tttttaaaag 


tctttttctt 


2640 




aatgtttgtt 


l ccc caagac 


tctgaaatgt 


gttgtgacaa 


caatgaccta 


2700 


tttatgatct 


taaatctttt 


tt 2722 










<210> 58 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_000917 












<400> 58 














tcttactctg ttttctggat 


ctgaagacta 


gtaataaact 


aggacactaa 


ctgggttcca 


60 


<210> 59 














<211> 3236 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_001109 












<400> 59 














gacccggcca 


tgcgcggcct 


cgggctctgg 


ctgctgggcg 


c at at ct 


gc gcga 




gcccccagcc 


ggccctgggc 


cctcatggag 


cagtatgagg 


tcatatta 


gg y g 


120 


ccaggccccc 


gagtccgccg 


agctctgccc 




ta 

g 


g g ggg g 




agctacgtcc 


ttggggccac 


agggcacaac 


ttcaccctcc 




gaacagggac 




ctgctgggtt 


ccggctacac 


agagacctat 


acggctgcca 


atggctccga 






cagcctcgcg 


ggcaggacca 


ctgcttatac 


cagggccacg 


tagaggggta 




360 


gccgccagcc 


tcagcacctg 


tgccggcctc 


aggggtttct 


tccaggtggg 


ate 


420 


cacctgatcg 


agcccctgga 


tgaaggtggc 






tacc t 




gagcacctgc 


tgcagacggc 


cgggacctgc 


ggggtcagcg 


acoacaocct 


aac 

ggg g c 


540 


ctgggacccc 


ggacggcagc 






gggactctct 






gagacccgct 


acgtggagct 


gtatgtggtc 


gtggacaatg 


cagagttcca 


ga rgctgggg 


660 


agcgaagcag 


ccgtgcgtca 


tcgggtgctg 








720 


cagaaactca 


acttccgtgt 


ggtcctggtg 


ggcctggaga 


tttggaatag 


tcaggacagg 


780 


ttccacgtca 


gccccgaccc 


cagtgtcaca 


ctggagaacc 


tcctgacctg 


geaggcaegg 


840 


caacggacac 


ggcggcacct 


gcatgacaac 


gtacagctca 


tcacgggtgt 


cgacttcacc 


900 


gggactactg 


tggggtttgc 


cagggtgtcc 


gccatgtgct 


cccacagctc 


aggggctgtg 


960 


aaccaggacc 


acagcaagaa 


ccccgtgggc 


gtggcctgca 


ccatggccca 


tgagatgggc 


1020 


cacaacctgg 


gcatggacca 


tgatgagaac 


gtccagggct 


gccgctgcca 


ggaaegctte 


1080 


gaggccggcc 


gctgcatcat 


ggcaggcagc 


attggctcca 


gtttccccag 


gatgttcagt 


1140 


gactgcagcc 


aggcctacct 


ggagagcttt 


ttggagcggc 


cgcagtcggt 


gtgcctcgcc 


1200 


aacgcccctg 


acctcagcca 


cctggtgggc 


ggccccgtgt 


gtgggaacct 


gtttgtggag 


1260 
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Cyuy yy y ay C 


ate ct 




tata 5 ° 


ggaaccgctg 


ctgcaactct 




accacctgcc 


acrcta t 


gggggece g 




gtacctgctg 


ccaggagtgc 


1 ^flO 


ttctat 3990 


caact ataa 
gg gg g 


g g y g 


g gg 


acatgtgtga 


cctcgaggag 








gag geccg 


gaagacgect 


tccaggagaa 


cggcacgccc 




gc ccgggg 


gc ac gc a 


caacggggcc 


tgt cccacac 


tggcccagca 


gtgccaggcc 




<= gggggc 


cagg gggca 


ggc gecgag 


gagt cct gefc 


tctcctatga 


catcctacca 


eon 


gg g agg 




cagggc gac 


atgtgtggcg 


ttctgcagtg 


caagggtggg 


con 
1680 


cagcagcccc 


tggggcgtgc 


catctgcatc 


gtggatgtgt 


gccacgcgct 


caccacagag 


1740 


ga ggcac g 


egtatgaace 


agtgcccgag 


ggcacccggt 


gtggaccaga 


gaaggtttgc 


1800 


ggaaaggac 


g gecagga 


ettacaegtt 


tacagatcca 


gcaactgctc 


tgcccagtgc 


1860 


cacaaccatg 


gggtgtgcaa 


ccacaagcag 


gagtgecact 


gccacgcggg 


ctgggccccg 




ccccactgcg 


egaagctget 


gactgaggtg 


cacgcagcgt 


ccgggagcct 


ccccgtcctc 


1980 


gtggtggtgg 


c gg gc 


cc tggcagtt 


gtgctggtca 


ccctggcagg 


catcatcgtc 


2040 


taccgcaaag 


cccggagccg 


catcctgagc 


aggaacgtgg 


ctcccaagac 


cacaatgggg 


2100 


cgc ccaacc 


ccc g cca 


ccaggc gec 


agccgcgtgc 


eggecaaggg 


cggggctcca 


2160 


gcccca cca 


ggggccccca 


agagc gg c 


cccaccaccc 


acccgggcca 


gcccgcccga 


2220 


cacccggcc 


cctcggtggc 


c gaagagg 


ccgccccctg 


ctcctccggt 


cactgtgtcc 


2280 


g 


tcccagttcc 


tgtctacacc 


cggcaggcac 


caaagcaggt 


catcaagcca 


2340 






cccagtcaaa 


cccggggctg 


gtgcggccaa 


ccctggtcca 


2400 


gctgagggtg 


c g ggece 


aaaggttgcc 


ctgaagcccc 


ccatccagag 


gaagcaagga 


2460 


gccggagctc 


ccacagcacc 


ctaggggggc 


acctgcgcct 


gtgtggaaat 


ttggagaagt 


2520 


tgcggcagag 


aageca gcg 


ttccagcctt 


ccacggfccca 


gctagtgccg 


ctcagcccta 


2580 


gaccctgact 


ttgeaggetc 


agctgctgtt 


ctaacctcag 


taatgeatet 


acctgagagg 


2640 


ctcctgctgt 


ccacgccctc 


agccaattcc 


ttctccccgc 


cttggccacg 


tgtagcccca 


2700 


gctgtctgca 


ggcaccaggc 


tgggatgagc 


tgtgtgcttg 


cgggtgcgtg 


tgtgtgtacg 


2760 


tgtctccagg 


tggccgctgg 


tctcccgctg 


tgttcaggag 


gecacatata 


cagcccctcc 


2820 


cagccacacc 


tgcccctgct 


ctggggcctg 


ctgagccggc 


tgccctgggc 


acccggttcc 


2880 


aggcagcaca 


gacgtggggc 


atccccagaa 


agactccatc 


ccaggaccag 


gttcccctcc 


2940 


gtgctcttcg 


agagggtgtc 


agtgagcaga 


ctgcacccca 


agctcccgac 


tccaggtccc 


3000 


ctgatcttgg 


gcctgtttcc 


catgggattc 


aagagggaca 


gccccagctt 


tgtgtgtgtt 


3060 


taagcttagg 


aatgcccttt 


atggaaaggg 


ctatgtggga 


gagtcagcta 


tcttgtctgg 


3120 


ttttcttgag 


acctcagatg 


tgtgttcagc 


agggctgaaa 


gcttttattc 


tttaataatg 


3180 


agaaatgtat 


attttactaa 


taaattattg 


accgagttct 


gtagattctt 


gttaga 3236 


<210> 60 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_001109 












<400> 60 














ctttatggaa 


agggctatgt 


gggagagtca 


gctatcttgt 


ctggttttct 


tgagacctca 


60 


<210> 61 














<211> 1449 














<212> DNA 














<213> Homo sapiens 












<300> 














<308> NM_001124 












<400> 61 














ctggatagaa cagctcaagc 


cttgccactt 


cgggcttctc 


actgeagctg 


ggcttggact 


60 


tcggagtttt 


gccattgcca 


gtgggacgtc 


tgagactttc 


tccttcaagt 


acttggcaga 


120 


tcactctctt 


agcagggtct 


gcgcttcgca 


geegggatga 


agctggtttc 


cgtcgccctg 


180 


atgtacctgg gttcgctcgc 


cttcctaggc 


gctgacaccg 


ctcggttgga 


tgtcgegteg 


240 


gagtttcgaa 


agaagtggaa 


taagtgggct 


ctgagtcgtg 


ggaagaggga 


actgeggatg 


300 


tccagcagct 


accccaccgg 


gctcgctgac 


gtgaaggccg 


ggcctgccca 


gacccttatt 


360 


cggccccagg 


acatgaaggg 


tgcctctcga 


agccccgaag 


acagcagtcc 


ggatgccgcc 


420 



29 



WO 2006/084272 



PCT/US2006/004280 



cgca ccgag 


tcaagcgcta 


ccgccagagc 


atgaacaact 


tccagggcct 


ccggagcttt 




ggc gc gc 


ggg g g 


cacggtgcag 


aagctggcac 


accagatcta 


ccagttcaca 


540 


ga aaggaca 


aggacaacgt 


cgcccccagg 


agcaagatca 


gcccccaggg 


ctacggccgc 




cggcgccggc 


gctccctgcc 


cgaggccggc 


ccgggtcgga 


ctctggtgtc 


ttctaageca 


660 


caagcacacg 


ggg etc cage 


ccccccgagt 


ggaagtgctc 


cccactttct 


ttaggattta 


720 


ggcgcccatg 


gtacaaggaa 


tagtcgegea 


agcatcccgc 


tggtgcctcc 


egggacgaag 


780 


gacttcccga 


gcggtgtggg 


.gaccgggctc 


tgacagccct 


gcggagaccc 


tgagtccggg 


840 


aggcaccg c 


eggeggegag 


ctctggcttt 


gcaagggccc 


ctccttctgg 


gggcttcget 




tccttagcct 


tgctcaggtg 


caagtgcccc 


agggggeggg 


gtgeagaaga 


atccgagtgt 


960 


ttgccaggct 


taaggagagg 


agaaactgag 


aaatgaatgc 


tgagaccccc 


ggagcagggg 


1020 


tctgagccac 


agccgtgct'c 


gcccacaaac 


tgatttctca 


cggcgtgtca 


ccccaccagg 


1080 


gcgcaagcct 


cactattact 


tgaactttcc 


aaaacctaaa 


gaggaaaagt 


geaatgegtg 


1140 


ttgtacatac 


agaggtaact 


atcaatattt 


aagtttgttg 


ctgtcaagat 


tttttttgta 


1200 


acttcaaata 


tagagatatt 


tttgtacgtt 


atatattgta 


ttaagggcat 


tttaaaagca 


1260 


attatattgt 


cctcccctat 


tttaagacgt 


gaatgtctca 


gcgaggtgta 


aagttgttcg 


1320 


ccgcgtggaa 


tgtgagtgtg 


tttgtgtgca 


tgaaagagaa 


agactgatta 


cctcctgtgt 


1380 


ggaagaagga 


aacaccgagt 


ctctgtataa 


tctatttaca 


taaaatgggt 


gatatgegaa 


1440 


cagcaaacc 


1449 












<210> 62 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_001124 












<400> 62 














gaaggaaaca 


ccgagtctct 


gtataatcta 


tttacataaa 


atgggtgata 


tgegaacage 


60 


<210> 63 














<211> 1619 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_001168 












<400> 63 














ccgccagatt 


tgaatcgegg 


gacccgttgg 


cagaggtggc 


ggeggeggea 


tgggtgcccc 


60 


gacgttgccc 


cctgcctggc 


agccctttct 


caaggaocac 


cgcatctcta 


cattcaagaa 


120 


ctggcccttc 


ttggagggct 


gcgcctgcac 


cccggagcgg 


atggccgagg 


ctggcttcat 


180 


ccactgcccc 


actgagaacg 


agecagaett 


ggcccagtgt 


ttcttctget 


tcaaggagct 


240 


ggaaggctgg 


gagecagatg 


acgaccccat 


agaggaacat 


aaaaagcatt 


cgtccggttg 


300 


cgctttcctt 


tctgtcaaga 


agcagtttga 


agaattaacc 


cttggtgaat 


ttttgaaact 


360 


ggacagagaa 


agagecaaga 


acaaaattgc 


aaaggaaacc 


aacaataaga 


agaaagaatt 


420 


tgaggaaact 


gcgaagaaag 


tgcgccgtgc 


catcgagcag 


ctggctgcca 


tggattgagg 


480 


cctctggccg 


gagctgectg 


gtcccagagt 


ggctgcacca 


cttccagggt 


ttattccctg 


540 


gtgccaccag 


ccttcctgtg 


ggccccttag 


caatgtctta 


ggaaaggaga 


tcaacatttt 


600 


caaattagat 


gtttcaactg 


tgctcctgtt 


ttgtcttgaa 


agtggcacca 


gaggtgette 


660 


tgcctgtgca 


gcgggtgctg 


ctggtaacag 


tggctgette 


tctctctctc 


tctctttttt 


720 


gggggctcat 


ttttgctgtt 




ggcttaccag 


gtgagaagtg 


agggaggaag 




aaggcagtgt 


cccttttgct 


agagctgaca 


gctttgttcg 


cgtgggcaga 




840 


gtgaatgtgt 


ctggacctca 


tgttgttgag 


gctgtcacag 


tcctgagtgt 


ggacttggca 


900 


ggtgcctgtt 


gaatctgagc 


tgcaggttcc 


ttatctgtca 


cacctgtgcc 


tcctcagagg 


960 


acagtttttt 


tgttgttgtg 


tttttttgtt 


tttttttttt 


ggtagatgea 


tgacttgtgt 


1020 


gtgatgagag 


aatggagaca 


gagtccctgg 


ctcctctact 


gtttaacaac 


atggctttct 


1080 


tattttgttt 


gaattgttaa 


ttcacagaat 


agcacaaact 


acaattaaaa 


ctaagcacaa 


1140 


agccattcta 


agtcattggg 


gaaacggggt 


gaacttcagg 


tggatgagga 


gacagaatag 


1200 


agtgatagga 


agegtctgge 


agatactcct 


tttgccactg 


ctgtgtgatt 


agacaggccc 


1260 


agtgagccgc 


ggggcacatg 


ctggccgctc 


ctccctcaga 


aaaaggcagt 


ggectaaate 


1320 
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ctttttaaat gacttggctc gatgctgtgg gggactggct gggctgctgc aggccgtgtg 1380 

tctgtcagcc caaccttcac atctgtcacg ttctccacac gggggagaga cgcagtccgc 1440 

ccaggtcccc gctttctttg gaggcagcag ctcccgcagg gctgaagtct ggcgtaagat 1500 

gatggatttg attcgccctc ctccctgtca tagagctgca gggtggattg ttacagcttc 1560 

gctggaaacc tctggaggtc atctcggctg ttcctgagaa ataaaaagcc tgtcatttc 1619 

<210> 64 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM 001168 



ttcacagaat agcacaaact acaattaaaa ctaagcacaa agccattcta agtcattggg 60 

<210> 65 
<211> 1552 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_001216 

<400> 65 
gcccgtacac 
agcccctggc 
ctgtcactgc 
tcccccttgg 
agtgaagagg 
gaggatctac 
tccctgaagt 
aataatgccc 
ccgccctggc 
cgcccccagc 
ctcccgccgc 
cctcctgggc 
ctgcactggg 
cctgccgaga 
gggcgcccgg 
agtgcctatg 
caggtcccag 
tatgaggggt 
cagacagtga 
ggtgactctc 
gaggcctcct 
aattcctgcc 
accagcgtcg 
gtgagctacc 
tgtgagaagc 
atgccacttc 



accgtgtgct 
tccctctgtt 
tgcttctgat 
gaggaggctc 
attcacccag 
ctggagagga 
tagaggatct 
acagggacaa 
cccgggtgtc 
tcgccgcctt 
tcccagaact 
tagagatggc 
gggctgcagg 
tccacgtggt 
gaggcctggc 
agcagttgct 
gactggacat 
ctctgactac 
tgctgagtgc 
ggctacagct 
tccctgctgg 
tggctgctgg 
cgttccttgt 
gcccagcaga 
cagccagagg 
cttttaactg 



gggacacccc 
gatcccggcc 
gcctgtccat 
ttctggggaa 
agaggaggat 
ggatctacct 
acctactgtt 
agaaggggat 
cccagoctgc 
ctgcccggcc 
gcgcctgcgc 
tctgggtccc 
tcgtccgggc 
tcacctcagc 
cgtgttggcc 
gtctcgcttg 
atctgcactc 
accgccctgt 
taagcagctc 
gaacttccga 
agtggacagc 
tgacatccta 
gcagatgaga 
ggtagccgag 
catctgaggg 
ccaagaaatt 



acagtcagcc 
cctgctccag 
ccccagaggt 
gatgacccac 
ccacccggag 
gaagttaagc 
gaggctcctg 
gaccagagtc 
gcgggccgct 
ctgcgccccc 
aacaatggcc 
gggcgggagt 
tcggagcaca 
accgcctttg 
gcctttctgg 
gaagaaatcg 
ctgccctctg 
gcccagggtg 
cacaccctct 
gcgacgcagc 
agtcctcggg 
gccctggttt 
aggcagcaca 
actggagcct 
ggagccggta 
ttttaaaata 



gcatggctcc 
gcctcactgt 
tgccccggat 
tgggcgagga 
aggaggatct 
ctaaatcaga 
gagatcctca 
attggcgcta 
tccagtcccc 
tggaactcct 
acagtgtgca 
accgggctct 
ctgtggaagg 
ccagagttga 
aggagggccc 
ctgaggaagg 
acttcagccg 
tcatctggac 
ctgacaccct 
ctttgaatgg 
ctgctgagcc 
ttggcctcct 
gaaggggaac 
agaggctgga 
actgtcctgt 
aatatttata 



cctgtgcccc 
gcaactgctg 
gcaggaggat 
ggatctgccc 
acctggagag 
agaagagggc 
agaaccccag 
tggaggcgac 
ggtggatatc 
gggcttccag 
actgaccctg 
gcagctgcat 
ccaccgtttc 
cgaggccttg 
ggaagaaaac 
ctcagagact 
ctacttccaa 
tgtgtttaac 
gtggggacct 
gcgagtgatt 
agtccagctg 
ttttgctgtc 
caaagggggt 
tcttggagaa 
cctgctcatt 
at 1552 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



<210> 66 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM 001216 
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<400> 66 

tcctgtcctg ctcattatgc cacttccttt taactgccaa gaaatttttt aaaataaata 60 

<210> 67 

<211> 2653 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001254 
<400> 67 

gagcgcggct ggagtttgct gctgccgctg tgcagtttgt tcaggggctt gtggtggtga 60 

gtccgagagg ctgcgtgtga gagacgtgag aaggatcctg cactgaggag gtggaaagaa 120 

gaggattgct cgaggaggcc tggggtctgt gaggcagcgg agctgggtga aggctgcggg 180 

ttccggcgag gcctgagctg tgctgtcgtc atgcctcaaa cccgatccca ggcacaggct 240 

acaatcagtt ttccaaaaag gaagctgtct cgggcattga acaaagctaa aaactccagt 300 

gatgccaaac tagaaccaac aaatgtccaa accgtaacct gttctcctcg tgtaaaagcc 360 

ctgcctctca gccccaggaa acgtctgggc gatgacaacc tatgcaacac tccccattta 420 

cctccttgtt ctccaccaaa gcaaggcaag aaagagaatg gtccccctca ctcacataca 4 80 

cttaagggac gaagattggt atttgacaat cagctgacaa ttaagtctcc tagcaaaaga 540 

gaactagcca aagttcacca aaacaaaata ctttcttcag ttagaaaaag tcaagagatc 600 

acaacaaatt ctgagcagag atgtccactg aagaaagaat ctgcatgtgt gagactattc 660 

aagcaagaag gcacttgcta ccagcaagca aagctggtcc tgaacacagc tgtcccagat 720 

cggctgcctg ccagggaaag ggagatggat gtcatcagga atttcttgag ggaacacatc 780 

tgtgggaaaa aagctggaag cctttacctt tctggtgctc ctggaactgg aaaaactgcc 840 

tgcttaagcc ggattctgca agacctcaag aaggaactga aaggctttaa aactatcatg 900 

ctgaattgca tgtccttgag gactgcccag gctgtattcc cagctattgc tcaggagatt 960 

tgtcaggaag aggtatccag gccagctggg aaggacatga tgaggaaatt ggaaaaacat 1020 

atgactgcag agaagggccc catgattgtg ttggtattgg acgagatgga tcaactggac 1080 

agcaaaggcc aggatgtatt gtacacgcta tttgaatggc catggctaag caattctcac 1140 

ttggtgctga ttggtattgc taataccctg gatctcacag atagaattct acctaggctt 1200 

caagctagag aaaaatgtaa gccacagctg ttgaacttcc caccttatac cagaaatcag 1260 

atagtcacta ttttgcaaga tcgacttaat caggtatcta gagatcaggt tctggacaat 1320 

gctgcagttc aattctgtgc ccgcaaagtc tctgctgttt caggagatgt tcgcaaagca 1380 

ctggatgttt gcaggagagc tattgaaatt gtagagtcag atgtcaaaag ccagactatt 1440 

ctcaaaccac tgtctgaatg taaatcacct tctgagcctc tgattcccaa gagggttggt 1500 

cttattcaca tatcccaagt catctcagaa gttgatggta acaggatgac cttgagccaa 1560 

gaaggagcac aagattcctt ccctcttcag cagaagatct tggtttgctc tttgatgctc 1620 

ttgatcaggc agttgaaaat caaagaggtc actctgggga agttatatga agcctacagt 1680 

aaagtctgtc gcaaacagca ggtggcggct gtggaccagt cagagtgttt gtcactttca 1740 

gggctcttgg aagccagggg cattttagga ttaaagagaa acaaggaaac ccgtttgaca 1800 

aaggtgtttt tcaagattga agagaaagaa atagaacatg ctctgaaaga taaagcttta 1860 

attggaaata tcttagctac tggattgcct taaattcttc tcttacaccc cacccgaaag 1920 

tattcagctg gcatttagag agctacagtc ttcattttag tgctttacac attcgggcct 1980 

gaaaacaaat atgacctttt ttacttgaag ccaatgaatt ttaatctata gattctttaa 2040 

tattagcaca gaataatatc tttgggtctt actattttta cccataaaag tgaccaggta 2100 

gacccttttt aattacattc actacttcta ccacttgtgt atctctagcc aatgtgcttg 2160 

caagtgtaca gatctgtgta gaggaatgtg tgtatattta cctcttcgtt tgctcaaaca 2220 

tgagtgggta tttttttgtt tgtttttttt gttgttgttg tttttgaggc gcgtctcacc 2280 

ctgttgccca ggctggagtg caatggcgcg ttctctgctc actacagcac ccgcttccca 2340 

ggttgaagtg attctcttgc ctcagcctcc cgagtagctg ggattacagg tgcccaccac 2400 

cgcgcccagc taatttttta atttttagta gagacagggt tttaccatgt tggccaggct 24 60 

ggtcttgaac tcctgaccct caagtgatct gcccaccttg gcctccctaa gtgctgggat 2520 

tataggcgtg agccaccatg ctcagccatt aaggtatttt gttaagaact ttaagtttag 2580 

ggtaagaaga atgaaaatga tccagaaaaa tgcaagcaag tccacatgga gatttggagg 2640 
acactggtta aag 2653 



<210> 68 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001254 
<400> 68 

caaggaaacc cgtttgacaa aggtgttttt caagattgaa gagaaagaaa tagaacatgc 60 

<210> 69 

<211> 627 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001323 



gcggccgcaa gctcggcact cacggctctg agggctccga cggcactgac ggccatggcg 60 

cgttcgaacc tcccgctggc gctgggcctg gccctggtcg cattctgcct cctggcgctg 120 

ccacgcgacg cccgggcccg gccgcaggag cgcatggtcg gagaactccg ggacctgtcg 180 

cccgacgacc cgcaggtgca gaaggcggcg caggcggccg tggccagcta caacatgggc 24 0 

agcaacagca tctactactt ccgagacacg cacatcatca aggcgcagag ccagctggtg 300 

gccggcatca agtacttcct gacgatggag atggggagca cagactgccg caagaccagg 360 

gtcactggag accacgtcga cctcaccact tgccccctgg cagcaggggc gcagcaggag 420 

aagctgcgct gtgactttga ggtccttgtg gttccctggc agaactcctc tcagctccta 480 

aagcacaact gtgtgcagat gtgataagtc cccgagggcg aaggccattg ggtttggggc 540 

catggtggag ggcacttcag gtccgtgggc cgtatctgtc acaataaatg gccagtgctg 600 

cttcttgcaa aaaaaaaaaa aaaaaaa 627 

<210> 70 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001323 
<400> 70 

atcaagtact tcctgacgat ggagatgggg agcacagact gccgcaagac cagggtcact 60 

<210> 71 

<211> 1812 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001428 



tagctaggca ggaagtcggc gcgggcggcg cggacagtat ctgtgggtac ccggagcacg 60 

gagatctcgc cggctttacg ttcacctcgg tgtctgcagc accctccgct tcctctccta 120 

ggcgacgaga cccagtggct agaagttcac catgtctatt ctcaagatcc atgccaggga 180 

gatctttgac tctcgcggga atcccactgt tgaggttgat ctcttcacct caaaaggtct 24 0 

cttcagagct gctgtgccca gtggtgcttc aactggtatc tatgaggccc tagagctccg 300 

ggacaatgat aagactcgct atatggggaa gggtgtctca aaggctgttg agcacatcaa 360 

taaaactatt gcgcctgccc tggttagcaa gaaactgaac gtcacagaac aagagaagat 420 

tgacaaactg atgatcgaga tggatggaac agaaaataaa tctaagtttg gtgcgaacgc 480 

cattctgggg gtgtcccttg ccgtctgcaa agctggtgcc gttgagaagg gggtccccct 540 

gtaccgccac atcgctgact tggctggcaa ctctgaagtc atcctgccag tcccggcgtt 600 

caatgtcatc aatggcggtt ctcatgctgg caacaagctg gccatgcagg agttcatgat 660 



<400> 69 



<400> 71 
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cc cccag c 


ggtgcagcaa 


acttcaggga 


agccatgcgc 


attggagcag 


aggtttacca 


720 


caacc 9 aa 9 


aatgtcatca 


aggagaaata 


tgggaaagat 


gccaccaatg 


tgggggatga 


780 


ggggg 


gc cccaaca 


tcctggagaa 


tiaaagaaggc 


ctggagctgc 


tgaagactgc 


840 


a 


gc ggc aca 


ctgataaggt 


ggtcatcggc 


atggacgtag 


cggcctccga 


900 


g c cagg 


tctgggaagt 


atgacctgga 


cttcaagtct 


cccgatgacc 


ccagcaggta 


960 




gaccagctgg 


ctgacctgta 


caagtccttc 


atcaaggact 


acccagtggt 


1020 


gtctatcgaa 


gatccctttg 


accaggatga 


ctggggagct 


tggcagaagt 


tcacagccag 


1080 


tgcaggaatc 


caggtagtgg 


gggatgatct 


cacagtgacc 


aacccaaaga 


ggatcgccaa 


1140 


ggccgtgaac 


gagaagtcct 


gcaactgcct 


cctgctcaaa 


gtcaaccaga 


ttggctccgt 


1200 


gaccgagtct 


cttcaggcgt 


gcaagctggc 


ccaggccaat 


ggttggggcg 


tcatggtgtc 


1260 


tcatcgttcg 


ggggagactg 


aagatacctt 


catcgctgac 


ctggttgtgg 


ggctgtgcac 


1320 


tgggcagatc 


aagactggtg 


ccccttgccg 


atctgagcgc 


ttggccaagt 


acaaccagct 


1380 


cctcagaatt 


gaagaggagc 


tgggcagcaa 


ggctaagttt 


gccggcagga 


acttcagaaa 


1440 


ccccttggcc 


aagtaagctg 


tgggcaggca 


agcccttcgg 


tcacctgttg 


gctacacaga 


1500 


cccctcccct 


cgtgtcagct 


caggcagctc 


gaggcccccg 


accaacactt 


gcaggggtcc 


1560 


ctgctagtta 


gcgccccacc 


gccgtggagt 


tcgtaccgct 


tccttagaac 


ttctacagaa 


1620 


gccaagctcc 


ctggagccct 


gttggcagct 


ctagctttgc 


agtcgtgtaa 


ttggcccaag 


1680 


tcattgtttt 


tctcgcctca 


ctttccacca 


agtgtctaga 


gtcatgtgag 


cctcgtgtca- 


1740 


tctccggggt 


ggccacaggc 


tagatccccg 


gtggttttgt 


gctcaaaata 


aaaagcctca 


1800 


gtgacccatg 


ag 1812 












<210> 72 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_001428 












<400> 72 














agctctagct 


tttgcagtcg 


tgtaatgggc 


ccaagtcatt 


gtttttctcg 


cctcactttc 


60 


<210> 73 














<211> 8368 














<212> DNA 














<213> Homo sapiens 












<300> 














<308> NM_001456 












<400> 73 














gcgatccggg 


cgccaccccg 


cggtcatcgg 


tcaccggtcg 


ctctcaggaa 


cagcagcgca 


60 


acctctgctc 


cctgcctcgc 


ctcccgcgcg 


cctaggtgcc 


tgcgacttta 


attaaagggc 


120 


cgtcccctcg 


ccgaggctgc 


agcaccgccc 


ccccggcttc 


tcgcgcctca 


aaatgagtag 


180 


ctcccactct 


cgggcgggcc 


agagcgcagc 


aggcgcggct 


ccgggcggcg 


gcgtcgacac 


240 


gcgggacgcc 


gagatgccgg 


ccaccgagaa 


ggacctggcg 


gaggacgcgc 


cgtggaagaa 


300 


gatccagcag 


aacactttca 


cgcgctggtg 


caacgagcac 


ctgaagtgcg 


tgagcaagcg 


360 


catcgccaac 


ctgcagacgg 


acctgagcga 


cgggctgcgg 


cttatcgcgc 


tgttggaggt 


420 


gctcagccag 


aagaagatgc 


accgcaagca 


caaccagcgg 


cccactttcc 


gccaaatgca 


480 


gcttgagaac 


gtgtcggtgg 


cgctcgagtt 


cctggaccgc 


gagagcatca 


aactggtgtc 


540 


catcgacagc 


aaggccatcg 


tggacgggaa 


cctgaagctg 


atcctgggcc 


tcatctggac 


600 


cctgatcctg 


cactactcca 


tctccatgcc 


catgtgggac 


gaggaggagg 






caagaagcag 


acccccaagc 


agaggctcct 


gggctggatc 


cagaacaagc 


tgccgcagct 


720 


gcccatcacc 


aacttcagcc 


gggactggca 


gagcggccgg 


gccctgggcg 


ccctggtgga 


780 


cagctgtgcc 


ccgggcctgt 


gtcctgactg 


ggactcttgg 


gacgccagca 


agcccgttac 


840 


caatgcgcga 


gaggccatgc 


agcaggcgga 


tgactggctg 


ggcatccccc 


aggtgatcac 


900 


ccccgaggag 


attgtggacc 


ccaacgtgga 


cgagcactct 


gtcatgacct 


acctgtccca 


960 


gttccccaag 


gccaagctga 


agccaggggc 


tcccttgcgc 


cccaaactga 


acccgaagaa 


1020 


agcccgtgcc 


tacgggccag 


gcatcgagcc 


cacaggcaac 


atggtgaaga 


agcgggcaga 


1080 


gttcactgtg 


gagaccagaa 


gtgctggcca 


gggagaggtg 


ctggtgtacg 


tggaggaccc 


1140 
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ggccggacac caggaggagg caaaagtgac cgccaataac gacaagaacc gcaccttctc 1200 

cgtctggtac gtccccgagg tgacggggac tcataaggtt actgtgctct ttgctggcca 1260 

gcacatcgcc aagagcccct tcgaggtgta cgtggataag tcacagggtg acgccagcaa 1320 

agtgacagcc caaggtcccg gcctggagcc cagtggcaac atcgccaaca agaccaccta 1380 

ctttgagatc tttacggcag gagctggcac gggcgaggtc gaggttgtga tccaggaccc 1440 

catgggacag aagggcacgg tagagcctca gctggaggcc cggggcgaca gcacataccg 1500 

ctgcagctac cagcccacca tggagggcgt ccacaccgtg cacgtcacgt ttgccggcgt 1560 

gcccatccct cgcagcccct acactgtcac tgttggccaa gcctgtaacc cgagtgcctg 1620 

ccgggcggtt ggccggggcc tccagcccaa gggtgtgcgg gtgaaggaga cagctgactt 1680 

caaggtgtac acaaagggcg ctggcagtgg ggagctgaag gtcaccgtga agggccccaa 1740 

gggagaggag cgcgtgaagc agaaggacct gggggatggc gtgtatggct tcgagtatta 1800 

ccccatggtc cctggaacct atatcgtcac catcacgtgg ggtggtcaga acatcgggcg 1860 

cagtcccttc gaagtgaagg tgggcaccga gtgtggcaat cagaaggtac gggcctgggg 1920 

ccctgggctg gagggcggcg tcgttggcaa gtcagcagac tttgtggtgg aggctatcgg 1980 

ggacgacgtg ggcacgctgg gcttctcggt ggaagggcca tcgcaggcta agatcgaatg 2040 

tgacgacaag ggcgacggct cctgtgatgt gcgctactgg ccgcaggagg ctggcgagta 2100 

tgccgttcac gtgctgtgca acagcgaaga catccgcctc agccccttca tggctgacat 2160 

ccgtgacgcg ccccaggact tccacccaga cagggtgaag gcacgtgggc ctggattgga 2220 

gaagacaggt gtggccgtca acaagccagc agagttcaca gtggatgcca agcacggtgg 2280 

caaggcccca cttcgggtcc aagtccagga caatgaaggc tgccctgtgg aggcgttggt 2340 

caaggacaac ggcaatggca cttacagctg ctcctacgtg cccaggaagc cggtgaagca 2400 

cacagccatg gtgtcctggg gaggcgtcag catccccaac agccccttca gggtgaatgt 24 60 

gggagctggc agccacccca acaaggtcaa agtatacggc cccggagtag ccaagacagg 2520 

gctcaaggcc cacgagccca cctacttcac tgtggactgc gccgaggctg gccaggggga 2580 

cgtcagcatc ggcatcaagt gtgcccctgg agtggtaggc cccgccgaag ctgacatcga 2640 

cttcgacatc atccgcaatg acaatgacac cttcacggtc aagtacacgc cccggggggc 2700 

tggcagctac accattatgg tcctctttgc tgaccaggcc acgcccacca gccccatccg 27 60 

agtcaaggtg gagccctctc atgacgccag taaggtgaag gccgagggcc ctggcctcag 2820 

tcgcactggt gtcgagcttg gcaagcccac ccacttcaca gtaaatgcca aagctgctgg 2880 

caaaggcaag ctggacgtcc agttctcagg actcaccaag ggggatgcag tgcgagatgt 2940 

ggacatcatc gaccaccatg acaacaccta cacagtcaag tacacgcctg tccagcaggg 3000 

tccagtaggc gtcaatgtca cttatggagg ggatcccatc cctaagagcc ctttctcagt 3060 

ggcagtatct ccaagcctgg acctcagcaa gatcaaggtg tctggcctgg gagagaaggt 3120 

ggacgttggc aaagaccagg agttcacagt caaatcaaag ggtgctggtg gtcaaggcaa 3180 

agtggcatcc aagattgtgg gcccctcggg tgcagcggtg ccctgcaagg tggagccagg 3240 

cctgggggct gacaacagtg tggtgcgctt cctgccccgt gaggaagggc cctatgaggt 3300 

ggaggtgacc tatgacggcg tgcccgtgcc tggcagcccc tttcctctgg aagctgtggc 3360 

ccccaccaag cctagcaagg tgaaggcgtt tgggccgggg ctgcagggag gcagtgcggg 3420 

ctcccccgcc cgcttcacca tcgacaccaa gggcgccggc acaggtggcc tgggcctgac 3480 

ggtggagggc ccctgtgagg cgcagctcga gtgcttggac aatggggatg gcacatgttc 3540 

cgtgtcctac gtgcccaccg agcccgggga ctacaacatc aacatcctct tcgctgacac 3600 

ccacatccct ggctccccat tcaaggccca cgtggttccc tgctttgacg catccaaagt 3660 

caagtgctca ggccccgggc tggagcgggc caccgctggg gaggtgggcc aattccaagt 3720 

ggactgctcg agcgcgggca gcgcggagct gaccattgag atctgctcgg aggcggggct 3780 

tccggccgag gtgtacatcc aggaccacgg tgatggcacg cacaccatta cctacattcc 3840 

cctctgcccc ggggcctaca ccgtcaccat caagtacggc ggccagcccg tgcccaactt 3900 

ccccagcaag ctgcaggtgg aacctgcggt ggacacttcc ggtgtccagt gctatgggcc 3960 

tggtattgag ggccagggtg tcttccgtga ggccaccact gagttcagtg tggacgcccg 4020 

ggctctgaca cagaccggag ggccgcacgt caaggcccgt gtggccaacc cctcaggcaa 4 080 

cctgacggag acctacgttc aggaccgtgg cgatggcatg tacaaagtgg agtacacgcc 4140 

ttacgaggag ggactgcact ccgtggacgt gacctatgac ggcagtcccg tgcccagcag 4200 

ccccttccag gtgcccgtga ccgagggctg cgacccctcc cgggtgcgtg tccacgggcc 4260 

aggcatccaa agtggcacca ccaacaagcc caacaagttc actgtggaga ccaggggagc 4320 

tggcacgggc ggcctgggcc tggctgtaga gggcccctcc gaggccaaga tgtcctgcat 4 380 

ggataacaag gacggcagct gctcggtcga gtacatccct tatgaggctg gcacctacag 4440 

cctcaacgtc acctatggtg gccatcaagt gccaggcagt cctttcaagg tccctgtgca 4500 

tgatgtgaca gatgcgtcca aggtcaagtg ctctgggccc ggcctgagcc caggcatggt 4560 

tcgtgccaac ctccctcagt ccttccaggt ggacacaagc aaggctggtg tggccccatt 4 620 

gcaggtcaaa gtgcaagggc ccaaaggcct ggtggagcca gtggacgtgg tagacaacgc 4 680 

tgatggcacc cagaccgtca attatgtgcc cagccgagaa gggccctaca gcatctcagt 4 740 

actgtatgga gatgaagagg taccccggag ccccttcaag gtcaaggtgc tgcctactca 4800 
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tgatgccagc aaggtgaagg ccagtggccc cgggctcaac accactggcg tgcctgccag 48 60 

cctgcccgtg gagttcacca tcgatgcaaa ggacgccggg gagggcctgc tggctgtcca 4 920 

gatcacggat cccgaaggca agccgaagaa gacacacatc caagacaacc atgacggcac 4980 

gtatacagtg gcctacgtgc cagacgtgac aggtcgctac accatcctca tcaagtacgg 5040 

tggtgacgag atccccttct ccccgtaccg cgtgcgtgcc gtgoccaccg gggacgccag 5100 

caagtgcact gtcacagtgt caatcggagg tcacgggcta ggtgctggca tcggccccac 5160 

cattcagatt ggggaggaga cggtgatcac tgtggacact aaggcggcag gcaaaggcaa 5220 

agtgacgtgc accgtgtgca cgcctgatgg ctcagaggtg gatgtggacg tggtggagaa 5280 

tgaggacggc actttcgaca tcttctacac ggccccccag ccgggcaaat acgtcatctg 534 0 

tgtgcgcttt ggtggcgagc acgtgcccaa cagccccttc caagtgacgg ctctggctgg 5400 

ggaccagccc tcggtgcagc cccctctacg gtctcagcag ctggccccac agtacaccta 54 60 

cgcccagggc ggccagcaga cttgggcccc ggagaggccc ctggtgggtg tcaatgggct 5520 

ggatgtgacc agcctgaggc cctttgacct tgtcatcccc ttcaccatca agaagggcga 5580 

gatcacaggg gaggttcgga tgccctcagg caaggtggcg cagcccacca tcactgacaa 5640 

caaagacggc accgtgaccg tgcggtatgc acccagcgag gctggcctgc acgagatgga 5700 

catccgctat gacaacatgc acatcccagg aagccccttg cagttctatg tggattacgt 5760 

caactgtggc catgtcactg cctatgggcc tggcctcacc catggagtag tgaacaagcc 5820 

tgccaccttc accgtcaaca ccaaggatgc aggagagggg ggcctgtctc tggccattga 5880 

gggcccgtcc aaagcagaaa tcagctgcac tgacaaccag gatgggacat gcagcgtgtc 5940 

ctacctgcct gtgctgccgg gggactacag cattctagtc aagtacaatg aacagcacgt 6000 

cccaggcagc cccttcactg ctcgggtcac aggtgacgac tccatgcgta tgtcccacct 6060 

aaaggtcggc tctgctgccg acatccccat caacatctca gagacggatc tcagcctgct 6120 

gacggccact gtggtcccgc cctcgggccg ggaggagccc tgtttgctga agcggctgcg 6180 

taatggccac gtggggattt cattcgtgcc caaggagacg ggggagcacc tggtgcatgt 6240 

gaagaaaaat ggccagcacg tggccagcag ccccatcccg gtggtgatca gccagtcgga 6300 

aattggggat gccagtcgtg ttcgggtctc tggtcagggc cttcacgaag gccacacctt 6360 

tgagcctgca gagtttatca ttgatacccg cgatgcaggc tatggtgggc tcagcctgtc 6420 

cattgagggc cccagcaagg tggacatcaa cacagaggac ctggaggacg ggacgtgcag 6480 

ggtcacctac tgccccacag agccaggcaa ctacatcatc aacatcaagt ttgccgacca 6540 

gcacgtgcct ggcagcccct tctctgtgaa ggtgacaggc gagggccggg tgaaagagag 6600 

catcacccgc aggcgtcggg ctccttcagt ggccaacgtt ggtagtcatt gtgacctcag 6660 

cctgaaaatc cctgaaatta gcatccagga tatgacagcc caggtgacca gcccatcggg 6720 

caagacccat gaggccgaga tcgtggaagg ggagaaccac acctactgca tccgctttgt 6780 

tcccgctgag atgggcacac acacagtcag cgtcaagtac aagggccagc acgtgcctgg 6840 

gagccccttc cagttcaccg tggggcccct aggggaaggg ggagcccaca aggtccgagc 6900 

tgggggccct ggcctggaga gagctgaagc tggagtgcca gccgaattca gtatctggac 6960 

ccgggaagct ggtgctggag gcctggccat tgctgtcgag ggccccagca aggctgagat 7020 

ctcttttgag gaccgcaagg acggctcctg tggtgtggct tatgtggtcc aggagccagg 7080 

tgactacgaa gtctcagtca agttcaacga ggaacacatt cccgacagcc ccttcgtggt 7140 

gcctgtggct tctccgtctg gcgacgcccg ccgcctcact gtttctagcc ttcaggagtc 7200 

agggctaaag gtcaaccagc cagcctcttt tgcagtcagc ctgaacgggg ccaagggggc 7260 

gatcgatgcc aaggtgcaca gcccctcagg agccctggag gagtgctatg tcacagaaat 7320 

tgaccaagat aagtatgctg tgcgcttcat ccctcgggag aatggcgttt acctgattga 7380 

cgtcaagttc aacggtaccc acatccctgg aagccccttc aagatccgag ttggggagcc 7440 

tgggcatgga ggggacccag gcttggtgtc tgcttacgga gcaggtctgg aaggcggtgt 7500 

cacagggaac ccagctgagt tcgtcgtgaa cacgagcaat gcgggagctg gtgccctgtc 7560 

ggtgaccatt gacggcccct ccaaggtgaa gatggattgc caggagtgcc ctgagggcta 7 620 

ccgcgtcacc tataccccca tggcacctgg cagctacctc atctccatca agtacggcgg 7 680 

cccctaccac attgggggca gccccttcaa ggccaaagtc acaggccccc gtctcgtcag 7740 

caaccacagc ctccacgaga catcatcagt gtttgtagac tctctgacca aggccacctg 7800 

tgccccccag catggggccc cgggtcctgg gcctgctgac gccagcaagg tggtggccaa 7860 

gggcctgggg ctgagcaagg cctacgtagg ccagaagagc agcttcacag tagactgcag 7920 

caaagcaggc aacaacatgc tgctggtggg ggttcatggc ccaaggaccc cctgcgagga 7980 

gatcctggtg aagcacgtgg gcagccggct ctacagcgtg tcctacctgc tcaaggacaa 8040 

gggggagtac acactggtgg tcaaatgggg gcacgagcac atcccaggca gcccctaccg 8100 

cgttgtggtg ccctgagtct ggggcccgtg ccagccggca gcccccaagc ctgccccgct 8160 

acccaagcag ccccgccctc ttcccctcaa ccccggccca ggccgccctg gccgcccgcc 8220 

tgtcactgca gctgcccctg ccctgtgccg tgctgcgctc acctgcctcc ccagccagcc 8280 

gctgacctct cggctttcac ttgggcagag ggagccattt ggtggcgctg cttgtcttct 834 0 
ttggttctgg gaggggtgag ggatgggg 8368 



36 



WO 2006/084272 PCT/US2006/004280 



<210> 74 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_001456 
<400> 74 

tgacctctcg gctttcactt gggcagaggg agccatttgg tggcgctgct tgtcttcttt 60 

<210> 75 
<211> 1642 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_001548 
<400> 75 

ccagatctca gaggagcctg gctaagcaaa accctgcaga acggctgcct aatttacagc 60 

aaccatgagt acaaatggtg atgatcatca ggtcaaggat agtctggagc aattgagatg 120 

tcactttaca tgggagttat ccattgatga cgatgaaatg cctgatttag aaaacagagt 180 

cttggatcag attgaattcc tagacaccaa atacagtgtg ggaatacaca acctactagc 240 

ctatgtgaaa cacctgaaag gccagaatga ggaagccctg aagagcttaa aagaagctga 300 

aaacttaatg caggaagaac atgacaacca agcaaatgtg aggagtctgg tgacctgggg 360 

caactttgcc tggatgtatt accacatggg cagactggca gaagcccaga cttacctgga 420 

caaggtggag aacatttgca agaagctttc aaatcccttc cgctatagaa tggagtgtcc 480 

agaaatagac tgtgaggaag gatgggcctt gctgaagtgt ggaggaaaga attatgaacg 54 0 

ggccaaggcc tgctttgaaa aggtgcttga agtggaccct gaaaaccctg aatccagcgc 600 

tgggtatgcg atctctgcct atcgcctgga tggctttaaa ttagccacaa aaaatcacaa 660 

gccattttct ttgcttcccc taaggcaggc tgtccgctta aatccagaca atggatatat 720 

taaggttctc cttgccctga agcttcagga tgaaggacag gaagctgaag gagaaaagta 780 

cattgaagaa gctctagcca acatgtcctc acagacctat gtctttcgat atgcagccaa 840 

gttttaccga agaaaaggct ctgtggataa agctcttgag ttattaaaaa aggccttgca 900 

ggaaacaccc acttctgtct tactgcatca ccagataggg ctttgctaca aggcacaaat 960 

gatccaaatc aaggaggcta caaaagggca gcctagaggg cagaacagag aaaagctaga 1020 

caaaatgata agatcagcca tatttcattt tgaatctgca gtggaaaaaa agcccacatt 1080 

tgaggtggct catctagacc tggcaagaat gtatatagaa gcaggcaatc acagaaaagc 1140 

tgaagagaat tttcaaaaat tgttatgcat gaaaccagtg gtagaagaaa caatgcaaga 1200 

catacatttc tactatggtc ggtttcagga atttcaaaag aaatctgacg tcaatgcaat 12 60 

tatccattat ttaaaagcta taaaaataga acaggcatca ttaacaaggg ataaaagtat 1320 

caattctttg aagaaattgg ttttaaggaa acttcggaga aaggcattag atctggaaag 1380 

cttgagcctc cttgggttcg tctataaatt ggaaggaaat atgaatgaag ccctggagta 1440 

ctatgagcgg gccctgagac tggctgctga ctttgagaac tctgtgagac aaggtcctta 1500 

ggcacccaga tatcagccac tttcacattt catttcattt tatgctaaca tttactaatc 1560 

atcttttctg cttactgttt tcagaaacat tataattcac tgtaatgatg taattcttga 1620 

ataataaatc tgacaaaata tt 1642 

<210> 76 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_001548 
<400> 76 

gtatcaattc tttgaagaaa ttggttttaa ggaaacttcg gagaaaggca ttagatctgg 60 
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<210> 77 

<211> 3344 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001605 
<400> 77 

ggtacagctg cgcgtctgcg ggaataggtg 
aggagctggg gacggcgacc ctaggagagt 
ctctaacagc .aagtgaaatc cggcagcgat 
cgtatgttca ctcgtctgcc accatcccat 
caggcatgaa ccagtttaaa cccattttcc 
caaagctgag cagagctgcc aatacccaga 
acctggacga tgtgggcaag gatgtctatc 
ggtcttttgg agattacttt aaggaattgg 
aagagtttgg cattcccatt gaaagacttt 
ctggcttaga agcagatctg gaatgcaaac 
ccaaaatcct cccaggcaac atgaaggata 
gtggtccttg cagtgagatc cactacgacc 
tcaaccagga cgaccctaat gtgctggaga 
gggaagctga tggcattctg aaacctcttc 
tggaacgact ggtatctgtg ctgcagaata 
tcccttactt tgaagccatt cagaagggca 
gtgctgagga tgccgatggg attgacatgg 
ccatcactgt ggcactggct gatggtggcc 
tgagacggat tctccgccga gctgtccgat 
gcttctttgc tacgttagtg gatgttgtcg 
tgaagaagga cccagacatg gtgaaggaca 
agactctcag cagagggcgt cgcatcctgg 
agaccattcc cggagacact gcttggctcc 
tgactggact gattgctgaa gagaagggcc 
agaggaaact ggcccagctg aaatcacagg 
tgctggacat ttacgctatc gaagagctcc 
ccccaaagta caattaccat ttggactcca 
ctacggtgat ggctctgcgc agggagaaga 
agtgtggagt ggtgctggac aagacctgtt 
acgaaggcta cctggtgaag gtggatgaca 
agaatgctca ggtccgagga gggtatgtgc 
aagtggggga tcaggtctgg ctgtttattg 
accacacagc tacgcacatt ctgaacttcg 
agaaaggctc attggttgct cctgaccgcc 
tgtccaccca acagatcaag aaggctgaag 
aggccgtcta tacccaggat tgccccctgg 
ctgtgtttga tgagacctat cctgaccctg 
ccgagttgct ggatgacccc tctgggcctg 
ggggaacgca cctgcggaac tcgagtcatg 
ccattgccaa gggtatccgg aggattgtgg 
tcaggaaagc agagagcttg aagaaatgtc 
agactgctcc aaacaaggat gtgcagaggg 
ctgcagtcat cccccagtgg cagaaggatg 
aggtcatgga tgacttggac cgagccagca 

agacgaagca gttcatcgac agcaacccca 
gcggcgcctc agccaaggcc ctgaatgaag 
agacttctgc catgctcttc acggtggaca 
aagtccccca gaatgcagcc aatcggggct 
caggcttgat ggacggtaaa ggtggtggca 
acgttggctg cctgcaggag gcgctgcagc 
gggatgtaaa gaactgagtg gggaaggagg 




PCT/LS2006/004280 



cagcgggccc ttggcggggg actctgaggg 60 

tctttggggt gactttcaag atggactcta 120 

ttatagattt cttcaagagg aacgagcata 180 

tggatgaccc cactttgctc tttgccaatg 240 

tgaacacaat tgacccatct caccccatgg 300 

agtgcatccg ggctgggggc aaacaaaatg 360 

atcacacctt cttcgagatg ctgggctctt 420 

catgtaagat ggctctggaa ctcctcaccc 480 

atgttactta ctttggcggg gatgaagcag 540 

agatctggca aaatttgggg ctggatgaca 600 

acttctggga gatgggtgac acgggcccct 660 

ggattggtgg tcgggacgcc gcacatcttg 720 

tctggaacct tgtgttcatc cagtataaca 780 

ccaagaaaag cattgacaca gggatgggcc 840 

agatgtccaa ctatgacact gacctttttg 900 

caggtgcccg accatacact gggaaagttg 960 

cctaccgggt gctggctgac catgctcgga 1020 

ggcctgacaa cacagggcgt ggatatgtgt 1080 

acgcccatga aaagctcaat gccagcaggg 1140 

tccagtccct gggagatgca tttcctgagc 1200 

tcattaatga agaagaggtg cagtttctca 1260 

acaggaaaat tcagagcctg ggagacagca 1320 

tctatgacac ctatgggttt ccagtggatc 1380 

tggtggtaga catggatggc tttgaagagg 1440 

gcaagggagc tggtggggaa gacctcatta 1500 

gggcacgggg tctggaggtc acagatgatt 1560 

gtggtagcta tgtatttgag aacacagtgg 1620 

tgttcgtgga agaggtgtcc acaggccagg 1680 

tctatgctga gcaaggaggc cagatctatg 1740 

gcagtgaaga taaaacagag tttacagtga 1800 

tacacattgg aaccatctac ggtgacctga 1860 

atgagccccg acgaagaccc atcatgagca 1920 

ccctgcgctc agtgcttggg gaagctgacc 1980 

tcagatttga ctttactgcc aagggagcca 2040 

agattgctaa tgagatgatt gaggcagcca 2100 

cagcagcgaa agccatccag ggcctacggg 2160 

tgcgagtcgt ctccattggg gtcccggtgt 2220 

ctggctccct gacttctgtt gagttctgtg 2280 

caggagcttt tgtgatcgtg acggaagaag 2340 

ctgtcacagg tgccgaggcc cagaaggccc 2400 

tctctgtcat ggaagccaaa gtgaaggctc 24 60 

agatcgctga ccttggagag gccctggcca 2520 

aattgcggga gactctcaaa tccctaaaga 2580 

aagccgatgt ccagaaacga gtgttagaga 2640 

accagcctct tgtcatcctg gagatggaga 2700 

ccttgaagct cttcaagatg cactcccctc 27 60 

atgaggctgg caagatcacg tgcctgtgtc 2820 

taaaagccag cgagtgggtg cagcaggtgt 2880 

aggatgtgtc tgcacaggcc acaggcaaga 2940 

tggccacttc cttcgcccag ctgcgcctcg 3000 

aggctcccac tggatccatc cgtccagcca 3060 
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agagctcttc atctgctaca agaacatttg aatcttggga cctttaaaga gcccctccta 3120 

acccagcagt aactggaaca cacttgggag cagtcctatg tctcagtgcc ccttaaattt 3180 

ctgccctgag ccctccacgt cagtgccatc ggtctagaac cactaacccc gcattgctgt 3240 

tgatcgtcac gctcgcatct atagataacg gctctccaga cctgagcttt ccgcgtcagc 3300 

aagtaggaat cgtttttgct gcagagaata aaaggaccac gtgc 3344 

<210> 78 

<211> 60 

<212> DNA ■ 

<213> Homo sapiens 

<300> 

<308> NM_001605 
<400> 78 

gccaagagct cttcatctgc tacaagaaca tttgaatctt gggaccttta aagagcccct 60 



<210> 79 

<211> 417 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001645 
<400> 79 

acctcccaac caagccctcc agcaaggatt caggagtgcc cctcgggcct cgccatgagg 60 

ctcttcctgt cgctcccggt cctggtggtg gttctgtcga tcgtcttgga aggcccagcc 120 

ccagcccagg ggaccccaga cgtctccagt gccttggata agctgaagga gtttggaaac 180 

acactggagg acaaggctcg ggaactcatc agccgcatca aacagagtga actttctgcc 240 

aagatgcggg agtggttttc agagacattt cagaaagtga aggagaaact caagattgac 300 

tcatgaggac ctgaagggtg acatccagga ggggcctctg aaatttccca caccccagcg 360 
cctgtgctga ggactcccgc catgtggccc caggtgccac caataaaaat cctaccg 417 

<210> 80 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001645 
<400> 80 

aaacagagtg aactttctgc caagatgcgg gagtggtttt cagagacatt tcagaaagtg 60 

<210> 81 

<211> 1389 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001809 
<400> 81 

cgcggacttc tgccaagcac cggctcatgt gaggctcgcg gcacagcgtt ctctgggctc 60 

cccagaagcc agcctttcgc tcccggaccc ggcagcccga gcaggagccg tgggaccggg 120 

cgccagcacc ctctgcggcg tgtcatgggc ccgcgccgcc ggagccgaaa gcccgaggcc 180 

ccgaggaggc gcagcccgag cccgaccccg acccccggcc cctcccggcg gggcccctcc 240 

ttaggcgctt cctcccatca acacagtcgg cggagacaag gttggctaaa ggagatccga 300 

aagcttcaga agagcacaca cctcttgata aggaagctgc ccttcagccg cctggcaaga 360 
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gaaatatgtg 


ttaaattcac 


tcgtggtgtg 




ggcaagccca 


ggccctattg 




gccctacaag 


aggcagcaga 




9 


ttgaggaege 


ctatctcctc 






ggccgag 


....... 


ccaaaggatg 


tgcaactggc 


ceggaggate 




cggggcc g 


aggagggac 


cggctgagct 


cctgcaccca 


gtgtttctgt 


cagtctttcc 






99999g a tga 


taccggggac 


tctccagagc 


catgactaga 


tccaatggat 




tctgcgatgc 


tgtctggact 




gaac g a 


gtgtgtgttg 


ctttaaatat 


Ton 
720 


ttttcttttt 


gagaagg 


agaagactgc 


atgactttcc 


tctgtaacag 


aggtaatata 


780 






caaaggcctg 


aaaataattt 


tcagataaag 


agactccaag 




gttgacttta 


gtttgtgagt 


tact cat gtg 


actatttgag 


gattttgaaa 


acatcagatt 


900 


tgctgtggta 


tgggagaaaa 


ggttatgtac 


ttattatttt 


agctctttct 


gtaatattta 


960 


cattttttac 


catatgtaca 


tttgtacttt 


tattttacac 


ataagggaaa 


aaataagacc 


1020 


actttgagca 


gttgcctgga 


aggctgggca 


tttccatcat 


atagacctct 


gcccttcaga 


1080 


gtagcctcac 


cattagtggc 


agcatcatgt 


aactgagtgg 


actgtgcttg 


teaaeggatg 


1140 


tgtagctttt 


cagaaactta 


attggggatg 


aatagaaaac 


ctgtaagctt 


tgatgttctg 


1200 


gttacttcta 


gtaaattcct 


gtcaaaatca 


attcagaaat 


tctaacttgg 


agaatttaac 


1260 


attttactct 


tgtaaatcat 


agaagatgta 


tcataacagt 


tcagaatttt 


aaagtacatt 


1320 


ttcgatgctt 


ttatgggtat 


ttttgtagtt 


tctttgtaga 


gagataataa 


aaatcaaaat 


1380 


atttaatga 


1389 












<210> 82 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_O01809 












<400> 82 














ggggatgaat 


agaaaacctg 


taagctttga 


tgttctggtt 


acttctagta. 


aattcct tc 
cc g c 


60 


<210> 83 














<211> 2205 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_001909 












<400> 83 














gcgcacgccg 


gccgcgccca 


cgtgaccggt 


ccgggtgcaa 


acacgegggt 


cagctgatcc 


60 


ggcccaactg 


cggcgtcatc 


ccggctataa 


gcgcacggcc 


tcggcgaccc 


tctccgaccc 


120 


ggccgccgcc 


gccatgcagc 


cctccagcct 


tctgccgctc 


gccctctgcc 


tgctggctgc 


180 


acccgcctcc 


gcgctcgtca 


ggatcccgct 


gcacaagttc 


acgtccatcc 


gccggaccat 


240 


gtcggaggtt 


gggggctctg 


tggaggacct 


ga ttgccaaa 


ggccccgtct 


caaagtactc 


300 


ccaggcggtg 


ccagccgtga 


cc 9 a 9999 cc 


cattcccgag 


gtgetcaaga 


actacatgga 


360 


cgcccagtac 


tacggggaga 


ttggcatcgg 


gacgcccccc 


cagtgettea 


cagtegtett 


420 


cgacacgggc 


tcctccaacc 


tgtgggtccc 


ctccatccac 


tgcaaactgc 


tggacatege 


480 


ttgctggatc 


caccacaagt 


acaacagcga 


caagtccagc 


acctacgtga 


agaatggtac 


540 


ctcgtttgac 


atccactatg 


gc cgggcag 


cctctccggg 


tacctgagcc 


aggacactgt 


600 


gtcggtgccc 


tgccagtcag 


eg eg cage 


ctctgccctg 


ggcggtgtca 


aagtggagag 


660 


gcaggtcttt 


ggggaggcca 


ccaagcagcc 


aggcat cacc 


ttcatcgcag 


ccaagttcga 


720 


tggcatcctg 


ggcatggcct 


acccccgcat 


ctccgtcaac 


aacgtgctgc 






caacctgatg 


cagcagaagc 


tggtggacca 


gaacatcttc 


tccttctacc 


tgagcaggga 


840 


cccagatgcg 


cagcctgggg 


gtgagctgat 


gctgggtggc 


acagactcca 


agtattacaa 


900 


gggttctctg 


tcctacctga 


atgtcacccg 


caaggcctac 


tggcaggtcc 


acctggacca 


960 


ggtggaggtg 


gccagcgggc 


tgaccctgtg 


caaggagggc 


tgtgaggcca 


ttgtggacac 


1020 


aggcacttcc 


ctcatggtgg 


gcccggtgga 


tgaggtgcgc 


gagctgeaga 


aggecategg 


1080 


ggccgtgccg 


ctgattcagg 


gcgagtacat 


gatcccctgt 


gagaaggtgt 


ccaccctgcc 


1140 


cgcgatcaca 


ctgaagctgg 


gaggcaaagg 


ctacaagctg 


tccccagagg 


actacacgct 


1200 


caaggtgtcg 


caggccggga 


agaccctctg 


cctgagcggc 


ttcatgggca 


tggacatccc 


1260 
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gccacccagc gggccactct ggatcctggg cgacgtcttc atcggccgct actacactgt 1320 

gtttgaccgt gacaacaaca gggtgggctt cgccgaggct gcccgcctct agttcccaag 1380 

gcgtccgcgc gccagcacag aaacagagga gagtcccaga gcaggaggcc cctggcccag 14 4 0 

cggcccctcc cacacacacc cacacactcg cccgcccact gtcctgggcg ccctggaagc 1500 

cggcggccca agcccgactt gqtgttttgt tctgtggttt tcccctccct gggttcagaa 1560 

atgctgcctg cctgtctgtc tctccatctg tttggtgggg gtagagctga tccagagcac 1620 

agatctgttt cgtgcattgg aagaccccac ccaagcttgg cagccgagct cgtgtatcct 1680 

ggggctccct tcatctccag ggagtcccct ccccggccct accagcgccc gctgggctga 1740 

gcccctaccc cacaccaggc cgtcctcccg ggccctccct tggaaacctg ccctgcctga 1800 

gggcccctct gcccagcttg ggcccagctg ggctctgcca ccctacctgt tcagtgtccc 1860 

gggcccgttg aggatgaggc cgctagaggc ctgaggatga gctggaagga gtgagagggg 1920 

acaaaaccca ccttgttgga gcctgcaggg tggtgctggg actgagccag tcccaggggc 1980 

atgtattggc ctggaggtgg ggttgggatt gggggctggt gccagccttc ctctgcagct 2040 

gacctctgtt gtcctcccct tgggcggctg agagccccag ctgacatgga aatacagttg 2100 

ttggcctccg gcctcccctc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2160 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 2205 

<210> 84 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_001909 
<400> 84 

tctgtttggt gggggtagag ctgatccaga gcacagatct gtttcgtgca ttggaagacc 60 

<210> 85 
<211> 817 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_002038 



gaaccgttta 


ctcgctgctg 


tgcccatcta 


tcagcaggct 


ccgggctgaa 


gattgcttct 


60 


cttctctcct 


ccaaggtcta 


gtgacggagc 


ccgcgcgcgg 


cgccaccatg 


cggcagaagg 


120 


cggtatcgct 


tttcttgtgc 


tacctgctgc 


tcttcacttg 


cagtggggtg 


gaggcaggta 


180 


agaaaaagtg 


ctcggagagc 


tcggacagcg 


gctccgggtt 


ctggaaggcc 


ctgaccttca 


240 


tggccgtcgg 


aggaggactc 


gcagtcgccg ggctgcccgc 


gctgggcttc 


accggcgccg 


300 


gcatcgcggc 


caactcggtg 


gctgcctcgc 


tgatgagctg 


gtctgcgatc 


ctgaatgggg 


360 


gcggcgtgcc 


cgccgggggg 


ctagtggcca 


cgctgcagag 


cctcggggct 


ggtggcagca 


420 


gcgtcgtcat 


aggtaatatt 


ggtgccctga 


tgggctacgc 


cacccacaag 


tatctcgata 


480 


gtgaggagga 


tgaggagtag 


ccagcagctc 


ccagaacctc 


ttcttccttc 


ttggcctaac 


540 


tcttccagtt 


aggatctaga 


actttgcctt 


tttttttttt 


tttttttttt 


tttgagatgg 


600 


gttctcacta 


tattgtccag 


gctagagtgc 


agtggctatt 


cacagatgcg 


aacatagtac 


660 


actgcagcct 


ccaactccta 


gcctcaagtg 


atcctcctgt 


ctcaacctcc 


caagtaggat 


720 


tacaagcatg 


cgccgacgat 


gcccagaatc 


cagaactttg 


tctatcactc 


tccccaacaa 


780 


cctagatgtg 


aaaacagaat 


aaacttcacc 


cagaaaa 817 






<210> 86 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_002038 













<400> 85 
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<400> 86 

agctcccaga acctcttctt ccttcttggc ctaactcttc cagttaggat ctagaacttt 60 

<210> 87 

<211> 1283 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002046 
<400> 87 

ctctctgctc ctcctgttcg acagtcagcc gcatcttctt ttgcgtcgcc agccgagcca 60 

catcgctcag acaccatggg gaaggtgaag gtcggagtca acggatttgg tcgtattggg 120 

cgcctggtca ccagggctgc ttttaactct ggtaaagtgg atattgttgc catcaatgac 180 

cccttcattg acctcaacta catggtttac atgttccaat atgattccac ccatggcaaa 240 

ttccatggca ccgtcaaggc tgagaacggg aagcttgtca tcaatggaaa tcccatcacc 300 

atcttccagg agcgagatcc ctccaaaatc aagtggggcg atgctggcgc tgagtacgtc 360 

gtggagtcca ctggcgtctt caccaccatg gagaaggctg gggctcattt gcagggggga 420 

gccaaaaggg tcatcatctc tgccccctct gctgatgccc ccatgttcgt catgggtgtg 480 

aaccatgaga agtatgacaa cagcctcaag atcatcagca atgcctcctg caccaccaac 540 

tgcttagcac ccctggccaa ggtcatccat gacaactttg gtatcgtgga aggactcatg 600 

accacagtcc atgccatcac tgccacccag aagactgtgg atggcccctc cgggaaactg 660 

tggcgtgatg gccgcggggc tctccagaac atcatccctg cctctactgg cgctgccaag 720 

gctgtgggca aggtcatccc tgagctgaac gggaagctca ctggcatggc cttccgtgtc 780 

cccactgcca acgtgtcagt ggtggacctg acctgccgtc tagaaaaacc tgccaaatat 840 

gatgacatca agaaggtggt gaagcaggcg tcggagggcc ccctcaaggg catcctgggc 900 

tacactgagc accaggtggt ctcctctgac ttcaacagcg acacccactc ctccaccttt 960 

gacgctgggg ctggcattgc cctcaacgac cactttgtca agctcatttc ctggtatgac 1020 

aacgaatttg gctacagcaa cagggtggtg gacctcatgg cccacatggc ctccaaggag 1080 

taagacccct ggaccaccag ccccagcaag agcacaagag gaagagagag accctcactg 1140 

ctggggagtc cctgccacac tcagtocccc accacactga atctcccctc ctcacagttg 1200 

ccatgtagac cccttgaaga ggggaggggc ctagggagcc gcaccttgtc atgtaccatc 1260 
aataaagtac cctgtgctca acc 1283 

<210> 88 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_00204 6 
<400> 88 

ctcaacgacc actttgtcaa gctcatttcc tggtatgaca acgaatttgg ctacagcaac 60 

<210> 89 
<211> 1610 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_002061 
<400> 89 

ggcacgaggc tgcggccgca 
cccgccgccg cccagccgcc 
ccgcgcggcg ccggcagcct 
gtcctgtggg cggccgcctg 
ggcggcccgt gccatgggca 
caccctgcac ctgcagacgg 



gtagccggag ccggagccgc agccaccggt gccttccttt 60 

gtccggcctc cctcgggccc gagcgcagac caggctccag 120 

cgcgctccct ctcgggtctc tctcgggcct cgggcaccgc 180 

cctgcccgcc cgcccgcagc cccttgcctg ccggcccctg 240 

ccgacagccg cgcggccaag gcgctcctgg cgcgggcccg 300 

ggaacctgct gaactggggc cgcctgcgga agaagtgccc 360 
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gtccacgcac agcgaggagc ttcatgattg tatccaaaaa accttgaatg aatggagttc 420 

ccaaatcaac ccagatttgg tcagggagtt tccagatgtc ttggaatgca ctgtatctca 480 

tgcagtagaa aagataaatc ctgatgaaag agaagaaatg aaagtttctg caaaactgtt 540 

cattgtagaa tcaaactctt catcatcaac tagaagtgca gttgacatgg cctgttcagt 600 

ccttggagtt gcacagctgg attctgtgat cattgcttca cctcctattg aagatggagt 660 

taatctttcc ttggagcatt tacagcctta ctgggaggaa ttagaaaact tagttcagag 720 

caaaaagatt gttgccatag gtacctctga tctagacaaa acacagttgg aacagctgta 780 

tcagtgggca caggtaaaac caaatagtaa ccaagttaat cttgcctcct gctgtgtgat 840 

gccaccagat ttgactgcat ttgctaaaca atttgacata cagctgttga ctcacaatga 900 

tccaaaagaa ctgctttctg aagcaagttt ccaagaagct cttcaggaaa gcattcctga 960 

cattcaagcg cacgagtggg tgccgctgtg gctactgcgg tattcggtca ttgtgaaaag 1020 

tagaggaatt atcaaatcaa aaggctacat tttacaagct aaaagaaggg gttcttaact 1080 

gacttaggag cataacttac ctgtaatttc cttcaatatg agagaaaatt gagatgtgta 1140 

aaatctagtt actgcctgta aatggtgtca ttgaggcaga tattctttcg tcatatttga 1200 

cagtatgttg tctgtcaagt tttaaatact tatcttgcct ccatatcaat ccattctcat 1260 

gaacctctgt attgctttcc ttaaactatt gttttctaat tgaaattgtc tataaagaaa 1320 

atacttgcaa tatatttttc ctttattttt atgactaata taaatcaaga aaatttgttg 1380 

ttagatatat tttggcctag gtatcagggt aatgtatata catatttttt atttccaaaa 1440 

aaaattcatt aattgcttct taactcttat tataaccaag caatttaatt acaattgtta 1500 

aaactgaaat actggaagaa gatatttttc ctgtcattga tgagatatat cagagtaact 1560 
ggagtagctg ggatttacta gtagtgtaaa taaaattcac tcttcaatac 1610 

<210> 90 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002061 
<400> 90 

ctgacttagg agcataactt acctgtaatt tccttcaata tgagagaaaa ttgagatgtg 60 

<210> 91 

<211> 873 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002106 



cgcagtttga atcgcggtgc gacgaaggag taggtggtgg gatctcaccg tgggtccgat 60 

tagccttttc tctgccttgc ttgcttgagc ttcagcggaa ttcgaaatgg ctggcggtaa 120 

ggctggaaag gactccggaa aggccaagac aaaggcggtt tcccgctcgc agagagccgg 180 

cttgcagttc ccagtgggcc gtattcatcg acacctaaaa tctaggacga ccagtcatgg 24 0 

acgtgtgggc gcgactgccg ctgtgtacag cgcagccatc ctggagtacc tcaccgcaga 300 

ggtacttgaa ctggcaggaa atgcatcaaa agacttaaag gtaaagcgta ttacccctcg 360 

tcacttgcaa cttgctattc gtggagatga agaattggat tctctcatca aggctacaat 420 

tgctggtggt ggtgtcattc cacacatcca caaatctctg attgggaaga aaggacaaca 480 

gaagactgtc taaaggatgc ctggattcct tgttatctca ggactctaaa tactctaaca 540 

gctgtccagt gttggtgatt ccagtggact gtatctctgt gaaaaacaca attttgcctt 600 

tttgtaattc tatttgagca agttggaagt ttaattagct ttccaaccaa ccaaatttct 660 

gcattcgagt cttaaccata tttaagtgtt actgtggctt caaagaagct attgattctg 720 

aagtagtggg ttttgattga gttgactgtt tttaaaaaac tgtttggatt ttaattgtga 780 

tgcagaagtt a'tagtaacaa acatttggtt ttgtacagac attatttcca ctctggtgga 84 0 
taagttcaat aaaggtcata tcccaaacta aaa 873 



<400> 91 



<210> 92 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002106 
<400> 92 

cgagtcttaa ccatatttaa gtgttactgt ggcttcaaag aagctattga ttctgaagta 60 

<210> 93 

<211> 4204 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002205 
<400> 93 

caggacaggg aagagcgggc gctatgggga gccggacgcc agagtcccct ctccacgccg 60 

tgcagctgcg ctggggcccc cggcgccgac ccccgctcgt gccgctgctg ttgctgctcg 120 

tgccgccgcc acccagggtc gggggcttca acttagacgc ggaggcccca gcagtactct 180 

cggggccccc gggctccttc ttcggattct cagtggagtt ttaccggccg ggaacagacg 240 

gggtcagtgt gctggtggga gcacccaagg ctaataccag ccagccagga gtgctgcagg 300 

gtggtgctgt ctacctctgt ccttggggtg ccagccccac acagtgcacc cccattgaat 360 

ttgacagcaa aggctctcgg ctcctggagt cctcactgtc cagctcagag ggagaggagc 420 

ctgtggagta caagtccttg cagtggttcg gggcaacagt tcgagcccat ggctcctcca 480 

tcttggcatg cgctccactg tacagctggc gcacagagaa ggagccactg agcgaccccg 540 

tgggcacctg ctacctctcc acagataact tcacccgaat tctggagtat gcaccctgcc 600 

gctcagattt cagctgggca gcaggacagg gttactgcca aggaggcttc agtgccgagt 660 

tcaccaagac tggccgtgtg gttttaggtg gaccaggaag ctatttctgg caaggccaga 720 

tcctgtctgc cactcaggag cagattgcag aatcttatta ccccgagtac ctgatcaacc 780 

tggttcaggg gcagctgcag actcgccagg ccagttccat ctatgatgac agctacctag 84 0 

gatactctgt ggctgttggt gaattcagtg gtgatgacac agaagacttt gttgctggtg 900 

tgcccaaagg gaacctcact tacggctatg tcaccatcct taatggctca gacattcgat 960 

ccctctacaa cttctcaggg gaacagatgg cctcctactt tggctatgca gtggccgcca 1020 

cagacgtcaa tggggacggg ctggatgact tgctggtggg ggcacccctg ctcatggatc 1080 

ggacccctga cgggcggcct caggaggtgg gcagggtcta cgtctacctg cagcacccag 1140 

ccggcataga gcccacgccc acccttaccc tcactggcca tgatgagttt ggccgatttg 1200 

gcagctcctt gacccccctg ggggacctgg accaggatgg ctacaatgat gtggccatcg 12 60 

gggctccctt tggtggggag acccagcagg gagtagtgtt tgtatttcct gggggcccag 1320 

gagggctggg ctctaagcct tcccaggttc tgcagcccct gtgggcagcc agccacaccc 1380 

cagacttctt tggctctgcc cttcgaggag gccgagacct ggatggcaat ggatatcctg 1440 

atctgattgt ggggtccttt ggtgtggaca aggctgtggt atacaggggc cgccccatcg 1500 

tgtccgctag tgcctccctc accatcttcc ccgccatgtt caacccagag gagcggagct 15 60 

gcagcttaga ggggaaccct gtggcctgca tcaaccttag cttctgcctc aatgcttctg 1620 

gaaaacacgt tgctgactcc attggtttca cagtggaact tcagctggac tggcagaagc 1680 

agaagggagg ggtacggcgg gcactgttcc tggcctccag gcaggcaacc ctgacccaga 17 4 0 

ccctgctcat ccagaatggg gctcgagagg attgcagaga gatgaagatc tacctcagga 1800 

acgagtcaga atttcgagac aaactctcgc cgattcacat cgctctcaac ttctccttgg 1860 

acccccaagc cccagtggac agccacggcc tcaggccagc cctacattat cagagcaaga 1920 

gccggataga ggacaaggct cagatcttgc tggactgtgg agaagacaac atctgtgtgc 1980 

ctgacctgca gctggaagtg tttggggagc agaaccatgt gtacctgggt gacaagaatg 2040 

ccctgaacct cactttccat gcccagaatg tgggtgaggg tggcgcctat gaggctgagc 2100 

ttcgggtcac cgcccctcca gaggctgagt actcaggact cgtcagacac ccagggaact 2160 

tctccagcct gagctgtgac tactttgccg tgaaccagag ccgcctgctg gtgtgtgacc 2220 

tgggcaaccc catgaaggca ggagccagtc tgtggggtgg ccttcggttt acagtccctc 2280 

atctccggga cactaagaaa accatccagt ttgacttcca gatcctcagc aagaatctca 2340 

acaactcgca aagcgacgtg gtttcctttc ggctctccgt ggaggctcag gcccaggtca 2400 

ccctgaacgg tgtctccaag cctgaggcag tgctattccc agtaagcgac tggcatcccc 24 60 

gagaccagcc tcagaaggag gaggacctgg gacctgctgt ccaccatgtc tatgagctca 2520 
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tcaaccaagg ccccagctcc attagccagg gtgtgctgga actcagctgt ccccaggctc 2580 

tggaaggtca gcagctccta tatgtgacca gagttacggg actcaactgc accaccaatc 2640 

accccattaa cccaaagggc ctggagttgg atcccgaggg ttccctgcac caccagcaaa 2700 

aacgggaagc tccaagccgc agctctgctt cctcgggacc tcagatcctg aaatgcccgg 2760 

aggctgagtg tttcaggctg cgctgtgagc tcgggcccct gcaccaacaa gagagccaaa 2820 

gtctgcagtt gcatttccga gtctgggcca agactttctt gcagcgggag caccagccat 2880 

ttagcctgca gtgtgaggct gtgtacaaag ccctgaagat gccctaccga atcctgcctc 2940 

ggcagctgcc ccaaaaagag cgtcaggtgg ccacagctgt gcaatggacc aaggcagaag 3000 

gcagctatgg cgtcccactg tggatcatca tcctagccat cctgtttggc ctcctgctcc 3060 

taggtctact catctacatc ctctacaagc ttggattctt caaacgctcc ctcccatatg 3120 

gcaccgccat ggaaaaagct cagctcaagc ctccagccac ctctgatgcc tgagtcctcc 3180 

caatttcaga ctcccattcc tgaagaacca gtccccccac cctcattcta ctgaaaagga 3240 

ggggtctggg tacttcttga aggtgctgac ggccagggag aagctcctct ccccagccca 3300 

gagacatact tgaagggcca gagccagggg ggtgaggagc tggggatccc tcccccccat 3360 

gcactgtgaa ggacccttgt ttacacatac cctcttcatg gatgggggaa ctcagatcca 3420 

gggacagagg cccagcctcc ctgaagcctt tgcattttgg agagtttcct gaaacaactg 3480 

gaaagataac taggaaatcc attcacagtt ctttgggcca gacatgccac aaggacttcc 354 0 

tgtccagctc caacctgcaa agatctgtcc tcagccttgc cagagatcca aaagaagccc 3600 

ccagtaagaa cctggaactt ggggagttaa gacctggcag ctctggacag ccccaccctg 3660 

gtgggccaac aaagaacact aactatgcat ggtgccccag gaccagctca ggacagatgc 3720 

cacaaggata gatgctggcc cagggccaga gcccagctcc aaggggaatc agaactcaaa 3780 

tggggccaga tccagcctgg ggtctggagt tgatctggaa cccagactca gacattggca 3840 

ccaatccagg cagatccagg actatatttg ggcctgctcc agacctgatc ctggaggccc 3900 

agttcaccct gatttaggag aagccaggaa tttcccagga cctgaagggg ccatgatggc 3960 

aacagatctg gaacctcagc ctggccagac acaggccctc cctgttcccc agagaaaggg 4020 

gagcccactg tcctgggcct gcagaatttg ggttctgcct gccagctgca ctgatgctgc 4080 

ccctcatctc tctgcccaac ccttccctca ccttggcacc agacacccag gacttattta 414 0 

aactctgttg caagtgcaat aaatctgacc cagtgccccc actgaccaga actagaaaaa 4200 
aaaa 4204 

<210> 94 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_002205 
<400> 94 

ttggcaccag acacccagga cttatttaaa ctctgttgca agtgcaataa atctgaccca 60 

<210> 95 
<211> 1976 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NMJD02266 
<400> 95 

gccacacggt ctttgagctg agtcgaggtg gaccctttga acgcagtcgc cctacagccg 60 

ctgattcccc ccgcatcgcc tcccgtggaa gcccaggccc gcttcgcagc tttctccctt 120 

tgtctcataa ccatgtccac caacgagaat gctaatacac cagctgcccg tcttcacaga 180 

ttcaagaaca agggaaaaga cagtacagaa atgaggcgtc gcagaataga ggtcaatgtg 240 

gagctgagga aagctaagaa ggatgaccag atgctgaaga ggagaaatgt aagctcattt 300 

cctgatgatg ctacttctcc gctgcaggaa aaccgcaaca accagggcac tgtaaattgg 360 

tctgttgatg acattgtcaa aggcataaat agcagcaatg tggaaaatca gctccaagct 420 

actcaagctg ccaggaaact actttccaga gaaaaacagc cccccataga caacataatc 480 

cgggctggtt tgattccgaa atttgtgtcc ttcttgggca gaactgattg tagtcccatt 54 0 

cagtttgaat ctgcttgggc actcactaac attgcttctg ggacatcaga acaaaccaag 600 

gctgtggtag atggaggtgc catcccagca ttcatttctc tgttggcatc tccccatgct 660 
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cacatcagtg aacaagctgt ctgggctcta ggaaacattg caggtgatgg ctcagt'gttc 720 

cgagacttgg ttattaagta cggtgcagtt gacccactgt tggctctcct tgcagttcct 780 

gatatgtcat ctttagcatg tggctactta cgtaatctta cctggacact ttctaatctt 84 0 

tgccgcaaca agaatcctgc acccccgata gatgctgttg agcagattct tcctacctta 900 

gttcggctcc tgcatcatga tgatccagaa gtgttagcag atacctgctg ggctatttcc 960 

taccttactg atggtccaaa • tgaacgaatt ggcatggtgg tgaaaacagg agttgtgccc 1020 

caacttgtga agcttctagg agcttctgaa ttgccaattg tgactcctgc cctaagagcc 1080 

atagggaata ttgtcactgg tacagatgaa cagactcagg ttgtgattga tgcaggagca 114 0 

ctcgccgtct ttcccagcct gctcaccaac cccaaaacta acattcagaa ggaagctacg 1200 

tggacaatgt caaacatcac agccggccgc caggaccaga tacagcaagt tgtgaatcat 1260 

ggattagtcc cattccttgt cagtgttctc tctaaggcag attttaagac acaaaaggaa 1320 

gctgtgtggg ccgtgaccaa ctataccagt ggtggaacag ttgaacagat tgtgtacctt 1380 

gttcactgtg gcataataga accgttgatg aacctcttaa ctgcaaaaga taccaagatt 1440 

attctggtta tcctggatgc catttcaaat atctttcagg ctgctgagaa actaggtgaa 1500 

actgagaaac ttagtataat gattgaagaa tgtggaggct tagacaaaat tgaagctcta 1560 

caaaaccatg aaaatgagtc tgtgtataag gcttcgttaa gcttaattga gaagtatttc 1620 

tctgtagagg aagaggaaga tcaaaacgtt gtaccagaaa ctacctctga aggctacact 1680 

ttccaagttc aggatggggc tcctgggacc tttaactttt agatcatgta gctgagacat 17 40 

aaatttgttg tgtactacgt ttggtatttt gtcttattgt ttctctacta agaactcttt 1800 

cttaaatgtg gtttgttact gtagcacttt ttacactgaa actatacttg aacagttcca 1860 

actgtacata catactgtat gaagcttgtc ctctgactag gtttctaatt tctatgtgga 1920 

atttcctatc ttgcagcatc ctgtaaataa acattcaagt ccacccttaa aaaaaa 1976 

<210> 96 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_002266 
<400> 96 

tgagtctgtg tataaggctt cgttaagctt aattgagaag tatttctctg tagaggaaga 60 

<210> 97 
<211> 1145 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_00234 6 



gctccggcca, gccgcggtcc agagcgcgcg aggttcgggg agctccgcca ggctgctggt 60 

acctgcgtcc gcccggcgag caggacaggc tgctttggtt tgtgacctcc aggcaggacg 120 

gccatcctct ccagaatgaa gatcttcttg ccagtgctgc tggctgccct tctgggtgtg 180 

gagcgagcca gctcgctgat gtgcttctcc tgcttgaacc agaagagcaa tctgtactgc 240 

ctgaagccga ccatctgctc cgaccaggac aactactgcg tgactgtgtc tgctagtgcc 300 

ggcattggga atctcgtgac atttggccac agcctgagca agacctgttc cccggcctgc 3 60 

cccatcccag aaggcgtcaa tgttggtgtg gcttccatgg gcatcagctg ctgccagagc 420 

tttctgtgca atttcagtgc ggccgatggc gggctgcggg caagcgtcac cctgctgggt 4 80 

gccgggctgc tgctgagcct gctgccggcc ctgctgcggt ttggcccctg accgcccaga 540 

ccctgtcccc cgatccccca gctcaggaag gaaagcccag ccctttctgg atcccacagt 600 

gtatgggagc ccctgactcc tcacgtgcct gatctgtgcc cttggtccca ggtcaggccc 660 

accccctgca cctccacctg ccccagcccc tgcctctgcc caagtgggcc agctgccctc 720 

acttctgggg tggatgatgt gaccttcctt gggggactgc ggaagggacg agggttccct 7 80 

ggagtcttac ggtccaacat cagaccaagt cccatggaca tgctgacagg gtccccaggg 840 

agaccgtgtc agtagggatg tgtgcctggc tgtgtacgtg ggtgtgcagt gcacgtgaga 900 

gcacgtggcg gcttctgggg gccatgtttg gggagggagg tgtgccagca gcctggagag 960 

cctcagtccc tgtagccccc tgccctggca cagctgcatg cacttcaagg gcagcctttg 1020 

ggggttgggg tttctgccac ttccgggtct aggccctgcc caaatccagc cagtcctgcc 1080 



<400> 97 
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<210> 98 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002346 
<400> 98 

ggttccctgg agtcttacgg tccaacatca gaccaagtcc catggacatg ctgacagggt 

<210> 99 
<211> 1390 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_002358 

<400> 99 
gggaagtgct 
ttgtgtccct 
gcgccgaaat 
gcatatatcc 
ctactgatct 
tatacaagtg 
tcctggaaag 
ccagagaaaa 
cagctacggt 
atacagacaa 
ccaattctga 
tggtggccta 
taattttgaa 
ttggttaatt 
cctttatttt 
cattgttcaa 
gatagtaact 
gttttggtca 
aaaggaagtc 
aacaatgaaa 
ttgaatcagt 
atatttgtac 
ttataaaatc 
aaaaaaaaaa 



gttggagccg 
ggccatggcg 
cgtggccgag 
atctgaaacc 
tgagctcata 
ttcagttcag 
atggcagttt 
gtctcagaaa 
gacatttctg 
agatttggtt 
ggaagtccgc 
caaaattcct 
atgtggtttt 
tttacatgga 
tttggtacct 
aaggaaccag 
gtagatggaa 
agtagtttga 
taaatat.tca 
tattgctgta 
ttccaattat 
tgtttaatgt 
aagttttaag 
1390 



ctgtggttgc 
ctgcagctct 
ttcttctcat 
tttactcgag 
aaatacctaa 
aaactggttg 
■gatattgagt 
gctatccagg 
ccactgttgg 
gtacctgaaa 
cttcgttcat 
gtcaatgact 
cctgaaatca 
gaaaaccaaa 
atttgactta 
gaggtttttt 
aaacttgtgc 
ctcagtatag 
gaatctttgt 
tagctccttt 
ttgactttaa 
tctgtgatac 
tgaaagtgag 



tgtccgcgga 
cccgggagca 
tcggcatcaa 
tgcagaaata 
ataatgtggt 
tagttatctc 
gtgacaagac 
atgaaatccg 
aagtttcttg 
aatgggaaga 
ttactactac 
gaggatgaca 
ggtcatctat 
atgatactta 
ccatggagtt 
tgtcaacatt 
tataaagcta 
gtagggagat 
taaggtcctg 
tgaccttcat 
tttatgtaac 
agaactctta 
gaaataaagt 



gtggaagcgc 
gggaatcacc 
cagcatttta 
cggactcacc 
ggaacaactg 
aaatattgaa 
tgcaaaagat 
ttcagtgatc 
ttcatttgat 
gtcgggacca 
aatccacaaa 
tgaggaaaat 
agttgatatg 
ctgaactgtg 
aacatcatga 
gtgatgtata 
gatgctttcc 
atttaagtat 
aaagtaactc 
ttcatgtata 
ttgaacctat 
aaaatgtttt 
taagtttgtt 



gtgcttttgt 
ctgcgcggga 
tatcagcgtg 
ttgcttgtaa 
aaagattggt 
agtggtgagg 
gacagtgcac 
agacagatca 
ctgctgattt 
cagtttatta 
gtaaatagca 
aatgtaattg 
ttttatttca 
tgtaattgtt 
atttattgca 
ttcctttgaa 
taaatcagat 
aaaatacaac 
ataatctata 
gttttcccta 
gaagcaatgg 
ttcatgtgtt 
ttaaaaaaaa 



<210> 100 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM 002358 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



<400> 100 

atgctttcct aaatcagatg ttttggtcaa gtagtttgac tcagtatagg tagggagata 

<210> 101 
<211> 1821 
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<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_002422 
<400> 101 

acaaggaggc aggcaagaca gcaaggcata gagacaacat agagctaagt aaagccagtg 
gaaatgaaga gtcttccaat cctactgttg ctgtgcgtgg cagtttgctc agcctatcca 
ttggatggag ctgcaagggg tgaggacacc agcatgaacc ttgttcagaa atatctagaa 
aactactacg acctcaaaaa agatgtgaaa cagtttgtta ggagaaagga cagtggtcct 
gttgttaaaa aaatccgaga aatgcagaag ttccttggat tggaggtgac ggggaagctg 
gactccgaca ctctggaggt gatgcgcaag cccaggtgtg gagttcctga tgttggtcac 
ttcagaacct ttcctggcat cccgaagtgg aggaaaaccc accttacata caggattgtg 
aattatacac cagatttgcc aaaagatgct gttgattctg ctgttgagaa agctctgaaa 
gtctgggaag aggtgactcc actcacattc tccaggctgt atgaaggaga ggctgatata 
atgatctctt ttgcagttag agaacatgga gacttttacc cttttgatgg acctggaaat 
gttttggccc atgcctatgc ccctgggcca gggattaatg gagatgccca ctttgatgat 
gatgaacaat ggacaaagga tacaacaggg accaatttat ttctcgttgc tgctcatgaa 
attggccact ccctgggtct ctttcactca gccaacactg aagctttgat gtacccactc 
tatcactcac tcacagacct gactcggttc cgcctgtctc aagatgatat aaatggcatt 
cagtccctct atggacctcc ccctgactcc cctgagaccc ccctggtacc cacggaacct 
gtccctccag aacctgggac gccagccaac tgtgatcctg ctttgtcctt tgatgctgtc 
agcactctga ggggagaaat cctgatcttt aaagacaggc acttttggcg caaatccctc 
aggaagcttg aacctgaatt gcatttgatc tcttcatttt ggccatctct tccttcaggc 
gtggatgccg catatgaagt tactagcaag gacctcgttt tcatttttaa aggaaatcaa 
ttctgggcca tcagaggaaa tgaggtacga gctggatacc caagaggcat ccacacccta 
ggtttccctc caaccgtgag gaaaatcgat gcagccattt ctgataagga aaagaacaaa 
acatatttct ttgtagagga caaatactgg agatttgatg agaagagaaa ttccatggag 
ccaggctttc ccaagcaaat agctgaagac tttccaggga ttgactcaaa gattgatgct 
gtttttgaag aatttgggtt cttttatttc tttactggat cttcacagtt ggagtttgac 
ccaaatgcaa agaaagtgac acacactttg aagagtaaca gctggcttaa ttgttgaaag 
agatatgtag aaggcacaat atgggcactt taaatgaagc taataattct tcacctaagt 
ctctgtgaat tgaaatgttc gttttctcct gcctgtgctg tgactcgagt cacactcaag 
ggaacttgag cgtgaatctg tatcttgccg gtcattttta tgttattaca gggcattcaa 
atgggctgct gcttagcttg caccttgtca catagagtga tctttcccaa gagaagggga 
agcactcgtg tgcaacagac aagtgactgt atctgtgtag actatttgct tatttaataa 
agacgatttg tcagttgttt t 1821 

<210> 102 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_002422 
<400> 102 

tgtagaaggc acaatatggg cactttaaat gaagctaata attcttcacc taagtctctg 60 

<210> 103 
<211> 2787 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_0024 62 
<400> 103 

agagcggagg ccgcactcca gcactgcgca gggaccgcct tggaccgcag ttgccggcca 60 
ggaatcccag tgtcacggtg gacacgcctc cctcgcgccc ttgccgccca cctgctcacc 120 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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cagctcaggg gctttggaat tctgtggcca cactgcgagg agatcggttc tgggtcggag 180 

gctacaggaa gactcccact ccctgaaatc tggagtgaag aacgccgcca tccagccacc 240 

attccaagga ggtgcaggag aacagctctg tgataccatt taacttgttg acattacttt 300 

tatttgaagg aacgtatatt agagcttact ttgcaaagaa ggaagatggt tgtttccgaa 360 

gtggacatcg caaaagctga tccagctgct gcatcccacc ctctattact gaatggagat 420 

gctactgtgg cccagaaaaa tccaggctcg gtggctgaga acaacctgtg cagccagtat 480 

gaggagaagg tgcgcccctg catcgacctc attgactccc tgcgggctct aggtgtggag 540 

caggacctgg ccctgccagc catcgccgtc atcggggacc agagctcggg caagagctcc 600 

gtgttggagg cactgtcagg agttgccctt cccagaggca gcgggatcgt gaccagatgc 660 

ccgctggtgc tgaaactgaa gaaacttgtg aacgaagata agtggagagg caaggtcagt 720 

taccaggact acgagattga gatttcggat gcttcagagg tagaaaagga aattaataaa 780 

gcccagaatg ccatcgccgg ggaaggaatg ggaatcagtc atgagctaat caccctggag 840 

atcagctccc gagatgtccc ggatctgact ctaatagacc ttcctggcat aaccagagtg 900 

gctgtgggca atcagcctgc tgacattggg tataagatca agacactcat caagaagtac 960 

atccagaggc aggagacaat cagcctggtg gtggtcccca gtaatgtgga catogccacc 1020 

acagaggctc tcagcatggc ccaggaggtg gaccccgagg gagacaggac catcggaatc 1080 

ttgacgaagc ctgatctggt ggacaaagga actgaagaca aggttgtgga cgtggtgcgg 1140 

aacctcgtgt tccacctgaa gaagggttac atgattgtca agtgccgggg ccagcaggag 1200 

atccaggacc agctgagcct gtccgaagcc ctgcagagag agaagatctt ctttgagaac 1260 

cacccatatt tcagggatct gctggaggaa ggaaaggcca cggttccctg cctggcagaa 1320 

aaacttacca gcgagctcat cacacatatc tgtaaatctc tgcccctgtt agaaaatcaa 1380 

atcaaggaga ctcaccagag aataacagag gagctacaaa agtatggtgt cgacataccg 1440 

gaagacgaaa atgaaaaaat gttcttcctg atagataaaa ttaatgcctt taatcaggac 1500 

atcactgctc tcatgcaagg agaggaaact gtaggggagg aagacattcg gctgtttacc 1560 

agactccgac acgagttcca caaatggagt acaataattg aaaacaattt tcaagaaggc 1620 

cataaaattt tgagtagaaa aatccagaaa tttgaaaatc agtatcgtgg tagagagctg 1680 

ccaggctttg tgaattacag gacatttgag acaatcgtga aacagcaaat caaggcactg 1740 

gaagagccgg ctgtggatat gctacacacc gtgacggata tggtccggct tgctttcaca 1800 

gatgtttcga taaaaaattt tgaagagttt tttaacctcc acagaaccgc caagtcoaaa 1860 

attgaagaca ttagagcaga acaagagaga gaaggtgaga agctgatccg cctccacttc 1920 

cagatggaac agattgtcta ctgccaggac caggtataca ggggtgcatt gcagaaggtc 1980 

agagagaagg agctggaaga agaaaagaag aagaaatcct gggattttgg ggctttccag 2040 

tccagctcgg caacagactc ttccatggag gagatctttc agcacctgat ggcctatcac 2100 

caggaggcca gcaagcgcat ctccagccac atccctttga tcatccagtt cttcatgctc 2160 

cagacgtacg gccagcagct tcagaaggcc atgctgcagc tcctgcagga caaggacacc 2220 

tacagctggc tcctgaagga gcggagcgac accagcgaca agcggaagtt cctgaaggag 2280 

cggcttgcac ggctgacgca ggctcggcgc cggcttgccc agttccccgg ttaaccacac 2340 

tctgtccagc cccgtagacg tgcacgcaca ctgtctgccc ccgttcccgg gtagccactg 2400 

gactgacgac ttgagtgctc agtagtcaga ctggatagtc cgtctctgct tatccgttag 2460 

ccgtggtgat ttagcaggaa gctgtgagag cagtttggtt tctagcatga agacagagcc 2520 

ccaccctcag atgcacatga gctggcggga ttgaaggatg ctgtcttcgt actgggaaag 2580 

ggattttcag ccctcagaat cgctccacct tgcagctctc cccttctctg tattcctaga 2640 

aactgacaca tgctgaacat cacagcttat ttcctcattt ttataatgtc ccttcacaaa 2700 

cccagtgttt taggagcatg agtgccgtgt gtgtgcgtcc tgtcggagcc ctgtctcctc 2760 
tctctgtaat aaactcattt ctagcag 2787 

<210> 104 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_002462 



<400> 104 

actgacacat gctgaacatc acagcttatt tcctcatttt tataatgtcc cttcacaaac 

<210> 105 
<211> 2808 
<212> DNA 
<213> Homo sapiens 
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<300> 

<308> NM 002759 



<400> 105 

gcggcggcgg 

tccacacttg 

gaggcatcga 

ttcgctggta 

ctcttttacc 

acctcaaaac 

aatcaacatc 

atacgggaag 

acataccgtc 

ccacatgata 

gaaggtagat 

aataaggaaa 

ttatccatgg 

gtaaattatg 

aaaatgggac 

ttggccgcta 

tacctgtcct 

agcacactcg 

aattctaaca 

caaaggaagg 

aagtatactg 

ggtggatttg 

aaacgtgtta 

gatcatgtaa 

accagtgatg 

aggtcaaaga 

caatggattg 

gaacaaataa 

aagccaagta 

gtaacatctc 

agcccagaac 

ctaattcttg 

acagacctac 

cagaaattac 

ttgactgtgt 

gaaaaagtat 

atatcaatag 

aatctttctg 

tactttttcc 

tgttgcccag 

gttcaagtga 

acccaactaa 

ctcaaactcc 

gggatgagcc 

aattttcttt 

cgcttttagg 

ggagaaatat 



cggcgcagtt 
gtagaaccac 
ggtccatccc 
tactcgctgt 
agtttctgga 
tttagcagtt 
cacacttccg 
aagaaatggc 
agaagcaggg 
ggaggtttac 
caaagaagga 
agaaggcagt 
ggaattacat 
aacagtgtgc 
agaaagaata 
aacttgcata 
ctggttcttt 
cttctgaatc 
gtgacagttt 
caaaaagatc 
tggacaagag 
gccaagtttt 
aatataataa 
atattgttca 
attctcttga 
ctaagtgcct 
aaaaaagaag 
caaaaggggt 
atatattctt 
tgaaaaatga 
agatttcttc 
ctgaacttct 
gggatggcat 
tctcaaagaa 
ggaagaaaag 
cctgcttctg 
atatttacct 
cagaaacaga 
tggctcatct 
gctggagtgc 
ttctcctgcc 
tttttgtgtt 
tgacctcaag 
accgcgccca 
ttatactatt 
ccaaaaaaat 
ggtactcatt 



tgctcatact 
aggcacgaca 
aataaaaatc 
ctgtcaacca 
gcaaattcag 
cttccatctg 
tgattatctg 
tggtgatctt 
agtagtactt 
atttcaagtt 
agcaaaaaat 
tagtccttta 
aggccttatc 
atcgggggtg 
tagtattggt 
tcttcagata 
tgctactacg 
atcatctgaa 
aaacagttct 
tttggcaccc 
gtttggcatg 
caaagcaaaa 
cgagaaggcg 
ctacaatggc 
gagcagtgat 
tttcatccaa 
aggcgagaaa 
ggattatata 
agtagataca 
tggaaagcga 
gcaagactat 
tcatgtatgt 
catctcagat 
acctgaggat 
cccagagaaa 
atatgcagtt 
tttattttaa 
aaggttttct 
ctttattctt 
aatgacacag 
tcagcctcct 
tttaataaag 
taatccacct 
gcctcatctc 
aatgaatcaa 
gtaagatcgt 
aaaaaaaaaa 



ttgtgacttg 
agoatagaaa 
aggagaccct 
gcggttgact 
tttgccttcc 
actcaggttt 
cgtgcatttt 
tcagcaggtt 
aaatatcaag 
ataatagatg 
gccgcagcca 
ttattgacaa 
aatagaattg 
catgggccag 
acaggttcta 
ttatcagaag 
tgtgagtccc 
ggtgacttct 
tcgttgctta 
agatttgacc 
gattttaaag 
cacagaattg 
gagcgtgaag 
tgttgggatg 
tatgatcctg 
atggaattct 
ctagacaaag 
cattcaaaaa 
aaacaagtaa 
acaaggagta 
ggaaaggaag 
gacactgctt 
atatttgata 
cgacctaaca 
aatgaacgac 
ttccttaaat 
tgtttccttt 
tctttttgct 
tttttttttt 
tcttggctca 
gagtagctgg 
acagggtttc 
gcctcggcct 
tttgttctaa 
tcaattcata 
tctctgcctc 
aaaaagtgat 



cggtcacagt ggcattcagc 
catcctaaac aatcttcatc 
ggctatcata gaccttagtc 
ttttttaagc cttctttttt 
tggatttgta aattgtaatg 
gcttctctgg cggtcttcag 
ggacaaagct tccaaccagg 
tcttcatgga ggaacttaat 
aactgcctaa ttcaggacct 
gaagagaatt tccagaaggt 
aattagctgt tgagatactt 
caacgaattc ttcagaagga 
cccagaagaa aagactaact 
aaggatttca ttataaatgc 
ctaaacagga agcaaaacaa 
aaacctcagt gaaatctgac 
aaagcaactc tttagtgacc 
cagcagatac atcagagata 
tgaatggtct cagaaataat 
ttcctgacat gaaagaaaca 
aaatagaatt aattggctca 
acggaaagac ttacgttatt 
taaaagcatt ggcaaaactt 
gatttgatta tgatcctgag 
agaacagcaa aaatagttca 
gtgataaagg gaccttggaa 
ttttggcttt ggaactcttt 
aattaattca tagagatctt 
agattggaga ctttggactt 
agggaacttt gcgatacatg 
tggacctcta cgctttgggg 
ttgaaacatc aaagtttttc 
aaaaagaaaa aactcttcta 
catctgaaat actaaggacc 
acacatgtta gagcccttct 
tatctaaaat ctgctaggga 
aattttttac tatttttact 
tcaaaaacat tcttacattt 
ttaaagacag agtctcgctc 
ctgcaacttc tgcctcttgg 
attacaggca tgtgccaccc 
accatgttgg ccaggctggt 
cccaaagtgc tgggattaca 
agatggaaaa accaccccca 
tctatttatt aaatttctac 
acatagctta caagccagct 
gtacaacc 2808 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340. 

2400 

2460 

2520 

2580 

2640 

2700 

2760 



<210> 106 
<211> 60 



<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_002759 
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<400> 106 

tcgttctctg cctcacatag cttacaagcc agctggagaa atatggtact cattaaaaaa 

<210> 107 

<211> 1678 

<212> DNA 

<213> Homo sapiens 



<400> 107 

aagaaggagg ccgcgcgagg gctgacgaac cggaagaaga ggaactgggc ctgaaagggt 60 

accggtgacc gctactgctg ccggtgtttg cgtgtggcag ggagccaggc ctggcgagcg 120 

gggtgtgtcg cgatgccgga gctggcagtg cagaaggtgg tggtccaccc cctggtgctg 180 

ctcagtgtgg tggatcattt caaccgaatc ggcaaggttg gaaaccagaa gcgtgttgtt 240 

ggtgtgcttt tggggtcatg gcaaaagaaa gtacttgatg tatcgaacag ttttgcagtt 300 

ccttttgatg aagatgacaa agacgattct gtatggtttt tagaccatga ttatttggaa 360 

aacatgtatg gaatgtttaa gaaagtcaat gccagggaaa gaatagttgg ctggtaccac 420 

acaggcccta aactacacaa gaatgacatt gccatcaacg aactcatgaa aagatactgt 480 

cctaattccg tattggtcat cattgatgtg aagccgaagg acctagggct gcctacagaa 54 0 

gcgtacattt cagtggaaga agtccatgat gatggaactc caacctcgaa aacatttgaa 600 

cacgtgacca gtgaaattgg agcagaggaa gctgaggaag ttggagttga acacttgtta 660 

cgagatatca aagacacgac ggtgggcact ctgtcccagc ggatcacaaa ccaggtccat 720 

ggtttgaagg gactgaactc caagcttctg gatatcagga gctacctgga aaaagtcgcc 780 

acaggcaagc tgcccatcaa ccaccagatc atctaccagc tgcaggacgt cttcaacctg 840 

ctgccagatg tcagcctgca ggagttcgtc aaggcctttt acctgaagac caatgaccag 900 

atggtggtag tgtacttggc ctcgctgatc cgttccgtgg tcgccctgca caacctcatc 960 

aacaacaaga ttgccaaccg ggatgcagag aagaaagaag ggcaggagaa agaagagagc 1020 

aaaaaggata ggaaagagga caaggagaaa gataaagata aggaaaagag tgatgtaaag 1080 

aaagaggaga aaaaggagaa aaagtaaaac atgtattaaa tagctttttt aatttgtaaa 1140 

ttaaaatctt acaaactaaa tcagtgtgct gctagagggt tctttttcac ttgacatgct 1200 

tattagaaag ctgacccaac aagagctctc tgcctccggt cactcttgct gtggtgctac 1260 

gtggaagtga atggagactg atctcaaatc tgaactgcag ctttcgctgc tgtgagttgg 1320 

ggatatgata gtcagctcag gcttcagatt gtatgagaaa aatgaagaga agtcaacaaa 1380 

tattttggta ctcttcattc atttatctct aaaaccagga gttgaatttt cctcatcttg 14 40 

aaagactctt ggggtctgtt tctggtattt tacaaaattg ctaagtggaa tgcatgaatt 1500 

gcattatgtt ctctggtaac acgtagagtt cagacccttc tgaactctgt tgataatacc 1560 

acaccatgtt ctggacccat agctctggca tcctcagggg ttgtgatcca gctccatata 1620 

ttgtttacct tcaaagatac aattaaatgg cttgattttt aaaaaaaaaa aaaaaaaa 1678 

<210> 108 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_002811 
<400> 108 

aaattgctaa gtggaatgca tgaattgcat tatgttctct ggtaacacgt agagttcaga 60 

<210> 109 
<211> 846 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_002888 
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<400> 109 

ccacgtccgg ggtgccgagc caactttcct gcgtccatgc agccccgccg gcaacggctg 60 

cccgctccct ggtccgggcc caggggcccg cgccccaccg ccccgctgct cgcgctgctg 120 

ctgttgctcg ccccggtggc ggcgcccgcg gggtccgggg gccccgacga ccctgggcag 180 

cctcaggatg ctggggtccc gcgcaggctc ctgcagcaga aggcgcgcgc ggcgcttcac 240 

ttcttcaact tccggtccgg ctcgcccagc gcgctgcgag tgctggccga ggtgcaggag 300 

ggccgcgcgt ggattaatcc aaaagaggga tgtaaagttc acgtggtctt cagcacagag 360 

cgctacaacc cagagtcttt acttcaggaa ggtgagggac gtttggggaa atgttctgct 420 

cgagtgtttt tcaagaatca gaaacccaga ccaaccatca atgtaacttg tacacggctc 4 80 

atcgagaaaa agaaaagaca acaagaggat tacctgcttt acaagcaaat gaagcaactg 540 

aaaaacccct tggaaatagt cagcatacct gataatcatg gacatattga tccctctctg 600 

agactcatct gggatttggc tttccttgga agctcttacg tgatgtggga aatgacaaca 660 

caggtgtcac actactactt ggcacagctc actagtgtga ggcagtgggt aagaaaaacc 720 

tgaaaattaa cttgtgccac aagagttaca atcaaagtgg tctccttaga ctgaattcat 7 80 

gtgaacttct aatttcatat caagagttgt aatcacattt atttcaataa atatgtgagt 840 
tcctgc 846 

<210> 110 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002888 
<400> 110 

aaagaaaaga caacaagagg attacctgct ttacaagcaa atgaagcaac tgaaaaaccc 60 

<210> 111 
<211> 1054 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> NM_003090 



<400> 111 

gaattccgcg ggaggccacg ggctttccac 

tcaagctgac ggcggagctg atcgagcagg 

gggagctgga cctccggggg tataaaattc 

accagtttga tgctattgat ttttctgaca 

tgttgagaag actgaaaaca ttgttagtga 

gacttgatca ggctctgccc tgtctgacag 

aactgggtga tctggaccct ctggcatctc 

gaaatccggt aaccaataag aagcattaca 

tcagagtact ggatttccag aaagtgaaac 

tcaagggcaa acggggtgca cagcttgcaa 

atccaggtgc tggtttgcca actgacaaaa 

aagcaatcaa gaatgccata gcaaatgctt 

ggttgctgca gtctggtcag atccctggca 

gtgaagaaga gatggaagaa gacacagtca 

ataataatag gccctcttgg aacaagtctt 

tgtgttagca aagtggaatc tatcagcatt 

tttgtaatat aagttttgaa atctaaatgt 

cactctctat gctaaaaaaa aaaaaaagga 

<210> 112 
<211> 60 
<212> DNA 

<213> Homo sapiens 



agcgcggggg aacgggaggc tgcaggatgg 60 

cggcgcagta caccaacgcg gtgcgcgacc 120 

ccgtcattga aaatctaggt gctacgttag 180 

atgagatcag gaaactggat ggttttcctt 240 

acaacaacag aatatgccgt ataggtgagg 300 

aactcattct caccaataat agtctcgtgg 360 

tcaaatcgct gacttaccta agtatcctaa 420 

gattgtatgt gatttataaa gttccgcaag 4 80 

taaaagagcg tcaggaagca gagaaaatgt 540 

aggatattgc caggagaagc aaaactttta 600 

agagaggtgg gccatctcca ggggatgtag 660 

caactctggc tgaagtggag aggctgaagg 720 

gagaacgcag atcagggccc actgatgatg 780 

caaacgggtc ctgagcagtg aggcagatgt 840 

gcttttcgaa catggtataa tagccttgtt 900 

gttgaaatgc ttaagactgc tgctgataat 960 

caattttcta caaattataa aaataaactc 1020 
attc 1054 
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<300> 

<308> NM_003090 
<400> 112 

taatagcctt gtttgtgtta gcaaagtgga atctatcagc attgttgaaa tgcttaagac 60 

<210> 113 
<211> 2033 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003158 



<400> 113 

gaattccggg actgagctct tgaagacttg 

tcactccatt gcccaggcca gagtgcggga 

gcgcggtgct catgcccgta atcccagcat 

taaagaaaac tgcatttcag gacctgttaa 

tgttctcgtg actcagcaat ttccttgtca 

tcagcgggtc ttgtgtcctt caaattcttc 

tgtctccagt cacaagccgg ttcagaatca 

acctcatcct gtctccaggc cactgaataa 

gcacctgaaa ataatcctga ggaggaactg 

agaggcagtg gctttggaag actttgaaat 

taatgtttat ttggcaagag aaaagcaaag 

taaagctcag ctggagaaag ccggagtgga 

gtcccacctt cggcatccta atattcttag 

agtctaccta attctggaat atgcaccact 

ttcaaagttt gatgagcaga gaactgctaa 

ttactgtcat tcgaagagag ttattcatag 

atcagctgga gagcttaaaa ttgcagattt 

gaggaccact ctctgtggca ccctggacta 

gcatgatgag aaggtggatc tctggagcct 

gaagcctcct tttgaggcaa acacatacca 

attcacattc cctgactttg taacagaggg 

gcataatccc agccagaggc caatgctcag 

aaattcatca aaaccatcaa attgccaaaa 

atcgtgcagg gggagaaatc cttgagccag 

actgaagttt attttaccat tgactgctgc 

tttgttttta ctcagcaggt gtgccttaac 

acatgacact ctgaagtgaa agtagccacg 

ctggaggcaa ggttcgactg cagtcgaacc 

gcacaattct cgtggctact ttcacttcag 

gaatagggag gttaaggcac acctgctgag 

aggaatctgg tgtctgtccg gccccggtag 

catctccata tgagagtgtg aaaataggaa 

ttgggataca gaagaggcca tgtgtctcag 

tggagtcata gcatgtgtgt aaactttaaa 

<210> 114 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003158 



ggtccttggt cgcaggtgga gcgacgggtc 60 

tatttgataa gaaacttcag tgaaggccgg 120 

tttcggaggc cgaggcatca tggaccgatc 180 

ggctacagct ccagttggag gtccaaaacg 240 

gaatccatta cctgtaaata gtggccaggc 300 

ccagcgcgtt cctttgcaag cacaaaagct 360 

gaagcagaag caattgcagg caaccagtgt 420 

cacccaaaag agcaagcagc ccctgccatc 480 

gcatcaaaac agaaaaatga agaatcaaaa 540 

tggtcgccct ctgggtaaag gaaagtttgg 600 

caagtttatt ctggctctta aagtgttatt 660 

gcatcagctc agaagagaag tagaaataca 720 

actgtatggt tatttccatg atgctaccag 780 

tggaacagtt tatagagaac ttcagaaact 84 0 

cttatataac agaattgcaa atgccctgtc 900 

agacattaag ccagagaact tacttcttgg 960 

tgggtggtca gtacatgctc catcttccag 1020 

cctgccccct gaaatgattg aaggtcggat 1080 

tggagttctt tgctatgaat ttttagttgg 1140 

agagacctac aaaagaatat cacgggttga 1200 

agccagggac ctcatttcaa gactgttgaa 1260 

agaagtactt gaacacccct ggatcacagc 1320 

caaagaatca gctagcaaac agtcttagga 1380 

ggctgccata taacctgaca ggaacatgct 1440 

cctcaatcta gaacgctaca caagaaatat 1500 

ctccctattc agaaagctcc acatcaataa 1560 

agaattgtgc tacttatact ggaacataat 1620 

ttgcctccag attatgaacc agtataagta 1680 

agtgtcatgt ttattgatgt ggagctttct 1740 

taaaacaaat atttcttgtg tagcgttctt 1800 

gcctgttggg tttctagtcc tccttaccat 18 60 

cacgtgctct acctccattt agggatttgc 1920 

agctgttaag ggcttatttt tttaaaacat 1980 
tatgcaggcc ttcgtggctc gag 2033 



<400> 114 

ttgggtttct agtcctcctt accatcatct ccatatgaga gtgtgaaaat aggaacacgt 60 
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<210> 115 
<211> 1421 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003258 
<400> 115 

acttactgcg ggacggcctt ggagagtact cgggttcgtg aacttcccgg aggcgcaatg 60 

agctgcatta acctgcccac tgtgctgccc ggctccccca gcaagacccg ggggcagatc 120 

caggtgattc tcgggccgat gttctcagga aaaagcacag agttgatgag acgcgtccgt 180 

cgcttccaga ttgctcagta caagtgcctg gtgatcaagt atgccaaaga cactcgctac 240 

agcagcagct tctgcacaca tgaccggaac accatggagg cgctgcccgc ctgcctgctc 300 

cgagacgtgg cccaggaggc cctgggcgtg gctgtcatag gcatcgacga ggggcagttt 360 

ttccctgaca tcatggagtt ctgcgaggcc atggccaacg ccgggaagac cgtaattgtg 420 

gctgcactgg atgggacctt ccagaggaag ccatttgggg ccatcctgaa cctggtgccg 4 80 

ctggccgaga gcgtggtgaa gctgacggcg gtgtgcatgg agtgcttccg ggaagccgcc 540 

tataccaaga ggctcggcac agagaaggag gtcgaggtga ttgggggagc agacaagtac 600 

cactccgtgt gtcggctctg ctacttcaag aaggcctcag gccagcctgc cgggccggac 660 

aacaaagaga actgcccagt gccaggaaag ccaggggaag ccgtggctgc caggaagctc 720 

tttgccccac agcagattct gcaatgcagc cctgccaact gagggacctg caagggccgc 780 

ccgctccctt cctgccactg ccgcctactg gacgctgccc tgcatgctgc ccagccactc 840 

caggaggaag tcgggaggcg tggagggtga ccacaccttg gccttctggg aactctcctt 900 

tgtgtggctg ccccacctgc cgcatgctcc ctcctctcct acccactggt ctgcttaaag 960 

cttccctctc agctgctggg acgatcgccc aggctggagc tggccccgct tggtggcctg 1020 

ggatctggca cactccctct ccttggggtg agggacagag ccccacgctg ttgacatcag 1080 

cctgcttctt cccctctgcg gctttcactg ctgagtttct gttctccctg ggaagcctgt 1140 

gccagcacct ttgagccttg gcccacactg aggcttaggc ctctctgcct gggatgggct 1200 

cccaccctcc cctgaggatg gcctggattc acgccctctt gtttcctttt gggctcaaag 1260 

cccttcctac ctctggtgat ggtttccaca ggaacaacag catctttcac caagatgggt 1320 

ggcaccaacc ttgctgggac ttggatccca ggggcttatc tcttcaagtg tggagagggc 1380 
agggtccacg cctctgctgt agcttatgaa attaactaat t 1421 

<210> 116 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003258 
<400> 116 

cttcctacct ctggtgatgg tttccacagg aacaacagca tctttcacca agatgggtgg 60 

<210> 117 

<211> 913 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003311 
<400> 117 

agagccggcg ccgtcaccgc ccgcattgcc gctcccagtc ccgcgctcgg cacgacatga 60 

aatcccccga cgaggtgcta cgcgagggcg agttggagaa gcgcagcgac agcctcttcc 120 

agctatggaa gaagaagcgc ggggtgctca cctccgaccg cctgagcctg ttccccgcca 180 

gcccccgcgc gcgccccaag gagctgcgct tccactccat cctcaaggtg gactgcgtgg 240 

agcgcacggg caagtacgtg tacttcacca tcgtcaccac cgaccacaag gagatcgact 300 

tccgctgcgc gggcgagagc tgctggaacg cggccatcgc gctggcgctc atcgatttcc 360 

agaaccgccg cgccctgcag gactttcgca gccgccagga acgcaccgca cccgccgcac 420 
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ccgccgagga 


cgccgtggct 


gccgcggccg 


ccgcaccctc 


cgagccctcg 


gagccctcca 


480 


ggccatcccc 


gcagcccaaa 


ccccgcacgc 


catgagcccg 


ccgcgggcca 


tacgetggae 


540 




9a ? 9C ^ a "f 


cgtggccggc 


gctctccagc 


cctgcagcag 


aagaacttcc 


600 






ccgttgcacg 


ggcgccttaa 


gttattggac 


tatctaatat 




ctatgtattt 


atttcgctgg 


ttctttgtag 


tcacatattt 


tatagtctta 


atatcttgtt 


720 


tttgcatcac 


tgtgcccatt 


gcaaataaat 


cacttggcca 


gtttgctttt 


ctaccatccg 


780 


gctgtggctc 


agtgagactc 


ctgctgggag 


ggtggaggcc 


caggaatggg 


egggcaggae 


840 


accctcatcc 


agtcctgcgg 


ggctggtgtg 


aaaggcgctg 


ggaaccggct 


ttgaatgaat 


900 


aaatgaatcg 


tgt 913 












<210> 118 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_003311 












<400> 118 














atttcgctgg 


ttctttgtag 


tcacatattt 


tatagtctta 


atatcttgtt 


tttgcatcac 


60 


<210> 119 














<211> 1723 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_003376 












<400> 119 














tcgcggaggc 


ttggggcagc 




ggaggtcgtg 


qcgctggggg 


ctagcaccag 


60 


cgctctgtcg 


ggaggcgcag 




gaccggtcag 


cggactcacc 


ggccagggcg 


120 


ctcggtgctg 


gaatttgata 


ttcattgatc 


cgggttttat 


ccctcttctt 


fctttcttaaa 


180 


catttttttt 


taaaactgta 


ttgtttctcg 


ttttaattta 


tttttgcttg 


ccattcccca 


240 


cttgaatcgg 


gccgacggct 


tggggagatt 


gctctacttc 


cccaaatcac 


tgtggatttt 


300 


ggaaaccagc 


agaaagagga 


aagaggtagc 


aagagctcca 


gagagaagtc 


gaggaagaga 


360 


gagacggggt 


cagagagagc 


gcgcgggcgt 


gcgagcagcg 


aaagcgacag 


gggcaaagtg 


420 


agtgacctgc 


ttttgggggt 


gaccgccgga 


gcgcggcgtg 


agccctcccc 


cttgggatcc 


480 


cgcagctgac 


cagtcgcgct 


gacggacaga 


cagacagaca 


ccgcccccag 


ccccagctac 


540 


cacctcctcc 


ccggccggcg 


gcggacagtg 


gacgcggcgg 


cgagccgcgg 


geaggggecg 


600 


gagcccgcgc 


ccggaggcgg 


ggtggagggg 


gtcggggctc 


gcggcgtcgc 


actgaaactt 


660 


ttcgtccaac 


ttctgggctg 


ttctcgcttc 


ggaggagccg 


tggtccgcgc 


gggggaagee 


720 


gagccgagcg 


gagccgcgag 


aagtgctagc 


tcgggccggg 


aggagccgca 


geeggaggag 


780 


ggggaggagg 


aagaagagaa 




a gggggccgc 


agtggcgact 


cggcgctcgg 


840 


aagccgggct 


catggacggg 


tgaggcggcg 


gtgtgcgcag 


acagtgctcc 


agccgcgcgc 


900 


gctccccagg 


ccctggcccg 


ggcctcgggc 


c gggg a 99 a a 


gagtagctcg 


ccgaggcgcc 


960 


gaggagagcg 


ggccgcccca 






gcgcgagccg 


cgccggcccc 


1020 


ggtcgggcct 


ccgaaaccat 


gaac c g 


c g c ggg 


tgcattggag 


ccttgccttg 


1080 


ctgctctacc 


tccaccatgc 






ccatggcaga 


aggaggaggg 




cagaatcatc 


acgaagtggt 


gaagttcatg 


gatgtctatc 


agcgcagcta 


ctgccatcca 


1200 


atcgagaccc 


tggtggacat 


cttccaggag 


taccctgatg 


agatcgagta 


catcttcaag 


1260 


ccatcctgtg 


tgcccctgat 


gcgatgcggg 


ggctgctgca 


atgacgaggg 


cctggagtgt 


1320 


gtgcccactg 


aggagtccaa 


catcaccatg 


cagattatgc 


ggatcaaacc 


tcaccaaggc 


1380 


cagcacatag 


gagagatgag 


cttcctacag 


cacaacaaat 


gtgaatgcag 


accaaagaaa 


1440 


gatagagcaa 


gacaagaaaa 


aaaatcagtt 


cgaggaaagg 


gaaaggggca 


aaaacgaaag 


1500 


cgcaagaaat 


cccggtataa 


gtcctggagc 


gttccctgtg 


ggccttgctc 


agageggaga 


1560 


aagcatttgt 


ttgtacaaga 


tccgcagacg 


tgtaaatgtt 


cctgcaaaaa 


cacagactcg 


1620 


cgttgcaagg 


cgaggcagct 


tgagttaaac 


gaacgtactt 


gcagatgtga 


caagecgagg 


1680 


cggtgagccg 


ggcaggagga 


aggagcctcc 


ctcagggttt 


egg 1723 
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<210> 120 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_00337 6 
<400> 120 

ccagcacata ggagagatga gcttcctaca gcacaacaaa tgtgaatgca gaccaaagaa 60 

<210> 121 
<211> 2834 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003406 
<400> 121 

gcccactccc accgccagct ggaaccctgg ggactacgac gtccctcaaa ccttgcttct 60 

aggagataaa aagaacatcc agtcatggat aaaaatgagc tggttcagaa ggccaaactg 120 

gccgagcagg ctgagcgata tgatgacatg gcagcctgca tgaagtctgt aactgagcaa 180 

ggagctgaat tatccaatga ggagaggaat cttctctcag ttgcttataa aaatgttgta 24 0 

ggagcccgta ggtcatcttg gagggtcgtc tcaagtattg aacaaaagac ggaaggtgct 300 

gagaaaaaac agcagatggc tcgagaatac agagagaaaa ttgagacgga gctaagagat 360 

atctgcaatg atgtactgtc tcttttggaa aagttcttga tccccaatgc ttcacaagca 420 

gagagcaaag tcttctattt gaaaatgaaa ggagattact accgttactt ggctgaggtt 480 

gccgctggtg atgacaagaa agggattgtc gatcagtcac aacaagcata ccaagaagct 540 

tttgaaatca gcaaaaagga aatgcaacca acacatccta tcagactggg tctggccctt 600 

aacttctctg tgttctatta tgagattctg aactccccag agaaagcctg ctctcttgca 660 

aagacagctt ttgatgaagc cattgctgaa cttgatacat taagtgaaga gtcatacaaa 720 

gacagcacgc taataatgca attactgaga gacaacttga cattgtggac atcggatacc 780 

caaggagacg aagctgaagc aggagaagga ggggaaaatt aaccggcctt ccaacttttg 840 

tctgcctcat tctaaaattt acacagtaga ccatttgtca tccatgctgt cccacaaata 900 

gttttttgtt tacgatttat gacaggttta tgttacttct atttgaattt ctatatttcc 960 

catgtggttt ttatgtttaa tattagggga gtagagccag ttaacattta gggagttatc 1020 

tgttttcatc ttgaggtggc caatatgggg atgtggaatt tttatacaag ttataagtgt 1080 

ttggcatagt acttttggta cattgtggct tcaaaagggc cagtgtaaaa ctgcttccat 1140 

gtctaagcaa agaaaactgc ctacatactg gtttgtcctg gcggggaata aaagggatca 1200 

ttggttccag tcacaggtgt agtaattgtg ggtactttaa ggtttggagc acttacaagg 1260 

ctgtggtaga atcatacccc atggatacca catattaaac catgtatatc tgtggaatac 1320 

tcaatgtgta cacctttgac tacagctgca gaagtgttcc tttagacaaa gttgtgaccc 1380 

attttactct ggataagggc agaaacggtt cacattccat tatttgtaaa gttacctgct 1440 

gttagctttc attatttttg ctacactcat tttatttgta tttaaatgtt ttaggcaacc 1500 

taagaacaaa tgtaaaagta aagatgcagg aaaaatgaat tgcttggtat tcattacttc 1560 

atgtatatca agcacagcag taaaacaaaa acccatgtat ttaacttttt tttaggattt 1620 

ttgcttttgt gatttttttt tttttttttt gatacttgcc taacatgcat gtgctgtaaa 1680 

aatagttaac agggaaataa cttgagatga tggctagctt tgtttaatgt cttatgaaat 1740 

tttcatgaac aatccaagca taattgttaa gaacacgtgt attaaattca tgtaagtgga 1800 

ataaaagttt tatgaatgga cttttcaact actttctcta cagcttttca tgtaaattag 1860 

tcttggttct gaaacttctc taaaggaaat tgtacatttt ttgaaattta ttccttattc 1920 

cctcttggca gctaatgggc tcttaccaag tttaaacaca aaatttatca taacaaaaat 1980 

actactaata taactactgt ttccatgtcc catgatcccc tctcttcctc cccaccctga 204 0 

aaaaaatgag ttcctatttt ttctgggaga gggggggatt gattagaaaa aaatgtagtg 2100 

tgttccattt aaaattttgg catatggcat tttctaactt aggaagccac aatgttcttg 2160 

gcccatcatg acattgggta gcattaactg taagttttgt gcttccaaat cactttttgg 2220 

tttttaagaa tttcttgata ctcttatagc ctgccttcaa ttttgatcct ttattctttc 2280 

tatttgtcag gtgcacaaga ttaccttcct gttttagcct tctgtcttgt caccaaccat 2340 

tcttacttgg tggccatgta cttggaaaaa ggccgcatga tctttctggc tccactcagt 2400 
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gtctaaggca ccctgcttcc tttgcttgca tcccacagac tatttccctc atcctattta 2460 

ctgcagcaaa tctctcctta gttgatgaga ctgtgtttat ctccctttaa aaccctacct 2520 

atcctgaatg gtctgtcatt gtctgccttt aaaatccttc ctctttcttc ctcctctatt 2580 

ctctaaataa tgatggggct aagttatacc caaagctcac tttacaaaat atttcctcag 2640 

tactttgcag aaaacaccaa acaaaaatgc cattttaaaa aaggtgtatt ttttctttta 2700 

gaatgtaagc tcctcaagag cagggacaat gttttctgta tgttctattg tgcctagtac 2760 

actgtaaatg ctcaataaat attgatgatg ggaggcagtg agtcttgatg ataagggtga 2820 
gaaactgaaa tccc 2834 

<210> 122 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003406 
<400> 122 

tttagccttc tgtcttgtca ccaaccattc ttacttggtg gccatgtact tggaaaaagg 60 

<210> 123 
<211> 1938 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003504 
<400> 123 

gatttggcgg gagtcttgac cgccgccggg ctcttggtac ctcagcgcga gcgccaggcg 60 

tccggccgcc gtggctatgt tcgtgtccga tttccgcaaa gagttctacg aggtggtcca 120 

gagccagagg gtccttctct tcgtggcctc ggacgtggat gctctgtgtg cgtgcaagat 180 

ccttcaggcc ttgttccagt gtgaccacgt gcaatatacg ctggttccag tttctgggtg 240 

gcaagaactt gaaactgcat ttcttgagca taaagaacag tttcattatt ttattctcat 300 

aaactgtgga gctaatgtag acctattgga tattcttcaa cctgatgaag acactatatt 360 

ctttgtgtgt gacacccata ggccagtcaa tgtcgtcaat gtatacaacg atacccagat 420 

caaattactc attaaacaag atgatgacct tgaagttccc gcctatgaag acatcttcag 480 

ggatgaagag gaggatgaag agcattcagg aaatgacagt gatgggtcag agccttctga 540 

gaagcgcaca cggttagaag aggagatagt ggagcaaacc atgcggagga ggcagcggcg 600 

agagtgggag gcccggagaa gagacatcct ctttgactac gagcagtatg aatatcatgg 660 

gacatcgtca gccatggtga tgtttgagct ggcttggatg ctgtccaagg acctgaatga 720 

catgctgtgg tgggccatcg ttggactaac agaccagtgg gtgcaagaca agatcactca ' 7 80 

aatgaaatac gtgactgatg ttggtgtcct gcagcgccac gtttcccgcc acaaccaccg 84 0 

gaacgaggat gaggagaaca cactctccgt ggactgcaca cggatctcct ttgagtatga 900 

cctccgcctg gtgctctacc agcactggtc cctccatgac agcctgtgca acaccagcta 960 

taccgcagcc aggttcaagc tgtggtctgt gcatggacag aagcggctcc aggagttcct 1020 

tgcagacatg ggtcttcccc tgaagcaggt gaagcagaag ttccaggcca tggacatctc 1080 

cttgaaggag aatttgcggg aaatgattga agagtctgca aataaatttg ggatgaagga 1140 

catgcgcgtg cagactttca gcattcattt tgggttcaag cacaagtttc tggccagcga 1200 

cgtggtcttt gccaccatgt ctttgatgga gagccccgag aaggatggct cagggacaga 1260 

tcacttcatc caggctctgg acagcctctc caggagtaac ctggacaagc tgtaccatgg 1320 

cctggaactc gccaagaagc agctgcgagc cacccagcag accattgcca gctgcctttg 1380 

caccaacctc gtcatctccc aggggccttt cctgtactgc tctctcatgg agggcactcc 1440 

agatgtcatg ctgttctcta ggccggcatc cctaagcctg ctcagcaaac acctgctcaa 1500 

gtcctttgtg tgttcgacaa agaaccggcg ctgcaaactg ctgcccctgg tgatggctgc 1560 

ccccctgagc atggagcatg gcacagtgac cgtggtgggc atccccccag agaccgacag 1620 

ctcggacagg aagaactttt ttgggagggc gtttgagaag gcagcggaaa gcaccagctc 1680 

ccggatgctg cacaaccatt ttgacctctc agtaattgag ctgaaagctg aggatcggag 17 4 0 

caagtttctg gacgcactta tttccctcct gtcctaggaa tttgattctt ccagaatgac 1800 

cttcttattt atgtaactgg ctttcattta gattgtaagt tatggacatg atttgagatg 1860 

tagaagccat tttttattaa ataaaatgct tattttaggc tccgtcccca aaaaaaaaaa 1920 
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aaaaaaaaaa aaaaaaaa 1938 

<210> 124 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_003504 
<400> 124 

caagtttctg gacgcactta tttccctcct gtcctaggaa tttgattctt ccagaatgac 60 

<210> 125 
<211> 2346 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003600 
<400> 125 

acaaggcagc ctcgctcgag cgcaggccaa tcggctttct agctagaggg tttaactcct 60 

atttaaaaag aagaaccttt gaattctaac ggctgagctc ttggaagaot tgggtccttg 120 

ggtcgcaggt gggagccgac gggtgggtag accgtggggg atatctcagt ggcggacgag 180 

gacggcgggg acaaggggcg gctggtcgga gtggcggagc gtcaagtccc ctgtcggttc 240 

ctccgtccct gagtgtcctt ggcgctgcct tgtgcccgcc cagcgccttt gcatccgctc 300 

ctgggcaccg aggcgccctg taggatactg cttgttactt attacagcta gaggcatcat 360 

ggaccgatct aaagaaaact gcatttcagg acctgttaag gctacagctc cagttggagg 420 

tccaaaacgt gttctcgtga ctcagcaatt tccttgtcag aatccattac ctgtaaatag 480 

tggccaggct cagcgggtct tgtgtccttc aaattcttcc cagcgcattc ctttgcaagc 540 

acaaaagctt gtctccagtc acaagccggt tcagaatcag aagcagaagc aattgcaggc 600 

aaccagtgta cctcatcctg tctccaggcc actgaataac acccaaaaga gcaagcagcc 660 

cctgccatcg gcacctgaaa ataatcctga ggaggaactg gcatcaaaac agaaaaatga 720 

agaatcaaaa aagaggcagt gggctttgga agactttgaa attggtcgcc ctctgggtaa 780 

aggaaagttt ggtaatgttt atttggcaag agaaaagcaa agcaagttta ttctggctct 840 

taaagtgtta tttaaagctc agctggagaa agccggagtg gagcatcagc tcagaagaga 900 

agtagaaata cagtcccacc ttcggcatcc taatattctt agactgtatg gttatttcca 960 

tgatgctacc agagtctacc taattctgga atatgcacca cttggaacag tttatagaga 1020 

acttcagaaa ctttcaaagt ttgatgagca gagaactgct acttatataa cagaattggc 1080 

aaatgccctg tcttactgtc attcgaagag agttattcat agagacatta agccagagaa 1140 

cttacttctt ggatcagctg gagagcttaa aattgcagat tttgggtggt cagtacatgc 1200 

tccatcttcc aggaggacca ctctctgtgg caccctggac tacctgcccc ctgaaatgat 1260 

tgaaggtcgg atgcatgatg agaaggtgga tctctggagc cttggagttc tttgctatga 1320 

atttttagtt gggaagcctc cttttgaggc aaacacatac caagagacct acaaaagaat 1380 

atcacgggtt gaattcacat tccctgactt tgtaacagag ggagccaggg acctcatttc 1440 

aagactgttg aagcataatc ccagccagag gccaatgctc agagaagtac ttgaacaccc 1500 

ctggatcaca gcaaattcat caaaaccatc aaattgccaa aacaaagaat cagctagcaa 1560 

acagtcttag gaatcgtgca gggggagaaa tccttgagcc agggctgcca tataacctga 1620 

caggaacatg ctactgaagt ttattttacc attgactgct gccctcaatc tagaacgcta 1680 

cacaagaaat atttgtttta ctcagcaggt gtgccttaac ctccctattc agaaagctcc 1740 

acatcaataa acatgacact ctgaagtgaa agtagccacg agaattgtgc tacttatact 1800 

ggttcataat ctggaggcaa ggttcgactg cagccgcccc gtcagcctgt gctaggcatg 1860 

gtgtcttcac aggaggcaaa tccagagcct ggctgtgggg aaagtgacca ctctgccctg 1920 

accccgatca gttaaggagc tgtgcaataa ccttcctagt acctgagtga gtgtgtaact 1980 

tattgggttg gcgaagcctg gtaaagctgt tggaatgagt atgtgattct ttttaagtat 2040 

gaaaataaag atatatgtac agacttgtat tttttctctg gtggcattcc tttaggaatg 2100 

ctgtgtgtct gtccggcacc ccggtaggcc tgattgggtt tctagtcctc cttaaccact 2160 

tatctcccat atgagagtgt gaaaaatagg aacacgtgct ctacctccat ttagggattt 2220 

gcttgggata cagaagaggc catgtgtctc agagctgtta agggcttatt tttttaaaac 2280 
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attggagtca tagcatgtgt gtaaacttta aatatgcaaa taaataagta tctatgtcta 234 0 
aaaaaa 2346 

<210> 126 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003600 
<400> 126 

agagtgtgaa aaataggaac acgtgctcta cctccattta gggatttgct -tgggatacag 60 

<210> 127 
<211> 853 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_003641 
<400> 127 

ctagtcctga cttcacttct gatgaggaag cctctctcct tagccttcag cctttcctcc 60 
caccctgcca taagtaattt gatcctcaag aagttaaacc acacctcatt ggtccctggc 120 
taattcacca atttacaaac agcaggaaat agaaacttaa gagaaataca cacttctgag 180 
aaactgaaac gacaggggaa aggaggtctc actgagcacc gtcccagcat ccggacacca 240 
cagcggccct tcgctccacg cagaaaacca cacttctcaa accttcactc aacacttcct 300 
tccccaaagc cagaagatgc acaaggagga acatgaggtg gctgtgctgg gggcaccccc 360 
cagcaccatc cttccaaggt ccaccgtgat caacatccac agcgagacct ccgtgcccga 420 
ccatgtcgtc tggtccctgt tcaacaccct cttcttgaac tggtgctgtc tgggcttcat 4 80 
agcattcgcc tactccgtga agtctaggga caggaagatg gttggcgacg tgaccggggc 54 0 
ccaggcctat gcctccaccg ccaagtgcct gaacatctgg gccctgattc tgggcatcct 600 
catgaccatt ggattcatcc tgtcactggt attcggctct gtgacagtct accatattat 660 
gttacagata atacaggaaa aacggggtta ctagtagccg cccatagcct gcaacctttg 720 
cactccactg tgcaatgctg gccctgcacg ctggggctgt tgcccctgcc cccttggtcc 780 
tgcccctaga tacagcagtt tatacccaca cacctgtcta cagtgtcatt caataaagtg 840 
cacgtgcttg tga 853 

<210> 128 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003641 
<400> 128 

attatgttac agataataca ggaaaaacgg ggttactagt agccgcccat agcctgcaac 60 

<210> 129 
<211> 1280 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003756 
<400> 129 
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gaaagatggc gtcccgcaag gaaggtaccg 
ccggcgcagc agggaaaggc aaaggcaaag 
tgcagataga tggccttgtg gtattaaaga 
gaactgaagt tgttcaagga gtgcttttgg 
ccaactgctt tcctttccct cagcacacag 
atcagatgga aatgatgcgg agccttcgcc 
ggtatcagtc cacatactat ggctcattcg 
gttaccagca tgccattgaa gaatctgtcg 
aaggatctct ctcactaaag gcatacagac 
aaaaggattt ttcccctgaa gcattgaaaa 
aagaagtgcc gattgtaatt aaaaattcac 
aaaagaagtc agctgttgca gataaacatg 
tggggaagaa tctacagttg ctgatggaca 
aatacaacac atacatgagg aatactagta 
agcgtcgcca gcaggagaat atgcagcgcc 
aggacctgtc caaactcttc aaaccaccac 
ttgcaggcca gataaacact tactgccaga 
gcaagctctt catggcccag gctcttcaag 
gaaaagaagt taacatgaac tcttgaagtc 
ttgcatattg aaaagcacag aggatttctt 
tgtctttcta gccataataa aataaaaaaa 
aaaaaaaaaa aaaaaaaaaa 1280 



gctctactgc cacctcttcc agctccaccg 60 

gcggctcggg agattcagcc gtgaagcaag 120 

taatcaaaca ttatcaagaa gaaggacaag 180 

gtctggttgt agaagatcgg cttgaaatta 240 

aggatgatgc tgactttgat gaagtccaat 300 

atgtaaacat tgatcatctt cacgtgggct 360 

ttacccgggc actcctggac tctcagttta 420 

ttctcattta tgatcccata aaaactgccc 4 80 

tgactcctaa actgatggaa gtttgtaaag 540 

aagcaaatat cacctttgag tacatgtttg 600 

atctgatcaa tgtcctaatg tgggaacttg 660 

aattgctcag ccttgccagc agcaatcatt 720 

gagtggatga aatgagccaa gatatagtta 780 

aacaacagca gcagaaacat cagtatcagc 840 

agagccgagg agaacccccg ctccctgagg 900 

agccgcctgc caggatggac tcgctgctca 960 

acatcaagga gttcactgcc caaaacttag 1020 

aatacaacaa ctaagaaaag gaagtttcca 1080 

acaccagggc aactcttgga agaaatatat 1140 

tagtgtcatt gccgattttg gctataacag 1200 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1260 



<210> 130 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003756 



<400> 130 

tgagccaaga tatagttaaa tacaacacat acatgaggaa tactagtaaa caacagcagc 60 

<210> 131 
<2il> 839 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003832 
<400> 131 

aagccacagg ctccctggct ggcgtcagct aaagtggctg ttgggtgtcc gcaggcttct 60 

gcctggccgc cgccgcctat aagctaccag gaggagcttt acgacttccc gtcctgcggg 120 

aagtggcggg cacgatcgca aggtagcgca gaagcttctc aatggccagc gccagctgca 180 

gccccggcgg cgcactcgcc tcacctgagc ctgggaggaa aattcttcca aggatgatct 240 

cccactcaga gctgaggaag cttttctact cagcagatgc tgtgtgtttt gatgttgaca 300 

gcacggtcat cagtgaagaa ggaatcggat gctttcattg gatttggagg aaatgtgatc 360 

aggcaacaag tcaaggataa cgccaaatgg tatatcactg attttgtaga gctgctggga 420 

gaaccggaag aataacatcc attgtcatac agctccaaac aacttcagat gaatttttac 480 

aagttacaca gattgatact gtttgcttac aattgcctat tacaacttgc tataaaaagt 540 

tggtacagat gatctgcact gtcaagtaaa ctacagttag gaatcctcaa agattggttt 600 

gtttgttttt aactgtagtt ccagtattat atgatcacta tcgatttcct ggagagtttt 660 

gtaatctgaa ttctttatgt atattcctag ctatatttca tacaaagtgt tttaagagtg 720 

gagagtcaat taaacacctt tactcttagg aatatagatt cggcagcctt cagtgaatat 780 

tggttttttt ccctttggta tgtcaataaa agtttatcca tgtgtcagaa aaaaaaaaa 839 

<210> 132 
<211> 60 
<212> DNA 
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<213> Homo sapiens 



<300> 

<308> NM_003832 
<400> 132 

gaagaaggaa tcggatgctt tcattggatt tggaggaaat gtgatcaggc aacaagtcaa 60 

<210> 133 
<211> 3128 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003981 
<400> 133 

gcttcgcccc gtggcgcggt ttgaaatttt gcggggctca acggctcgcg gagcggctac 60 

gcggagtgac atcgccggtg tttgcgggtg gttgttgctc tcggggccgt gtggagtagg 120 

tctggacctg gactcacggc tgcttggagc gtccgccatg aggagaagtg aggtgctggc 180 

ggaggagtcc atagtatgtc tgcagaaagc cctaaatcac cttcgggaaa tatgggagct 240 

aattgggatt ccagaggacc agcggttaca aagaactgag gtggtaaaga agcatatcaa 300 

ggaactcctg gatatgatga ttgctgaaga ggaaagcctg aaggaaagac tcatcaaaag 360 

catatccgtc tgtcagaaag agctgaacac tctgtgcagc gagttacatg ttgagccatt 420 

tcaggaagaa ggagagacga ccatcttgca actagaaaaa gatttgcgca cccaagtgga 4 80 

attgatgcga aaacagaaaa aggagagaaa acaggaactg aagctacttc aagagcaaga 540 

tcaagaactg tgcgaaattc tttgtatgcc ccactatgat attgacagtg cctcagtgcc 600 

cagcttagaa gagctgaacc agttcaggca acatgtgaca actttgaggg aaacaaaggc 660 

ttctaggcgt gaggagtttg tcagtataaa gagacagatc atactgtgta tggaagaatt 720 

agaccacacc ccagacacaa gctttgaaag agatgtggtg tgtgaagacg aagatgcctt 780 

■ ttgtttgtct ttggagaata ttgcaacact acaaaagttg ctacggcagc tggaaatgca 840 

gaaatcacaa aatgaagcag tgtgtgaggg gctgcgtact caaatccgag agctotggga 900 

caggttgcaa atacctgaag aagaaagaga agctgtggcc accattatgt ctgggtcaaa 960 

ggccaaggtc cggaaagcgc tgcaattaga agtggatcgg ttggaagaac tgaaaatgca 1020 

aaacatgaag aaagtgattg aggcaattcg agtggagctg gttcagtact gggaccagtg 1080 

cttttatagc caggagcaga gacaagcttt tgcccctttc tgtgctgagg actacacaga 1140 

aagtctgctc cagctccacg atgctgagat tgtgcggtta aaaaactact atgaagttca 1200 

caaggaactc tttgaaggtg tccagaagtg ggaagaaacc tggaggcttt tcttagagtt 1260 

tgagagaaaa gcttcagatc caaatcgatt tacaaaccga ggaggaaatc ttctaaaaga 1320 

agaaaaacaa cgagccaagc tccagaaaat gctgcccaag ctggaagaag agttgaaggc 1380 

acgaattgaa ttgtgggaac aggaacattc aaaggcattt atggtgaatg ggcagaaatt 1440 

catggagtat gtggcagaac aatgggagat gcatcgattg gagaaagaga gagccaagca 1500 

ggaaagacaa ctgaagaaca aaaaacagac agagacagag atgctgtatg gcagcgctcc 1560 

tcgaacacct agcaagcggc gaggactggc tcccaataca ccgggcaaag cacgtaagct 1620 

gaacactacc accatgtcca atgctacggc caatagtagc attcggccta tctttggagg 1680 

gacagtctac cactcccccg tgtctcgact tcctccttct ggcagcaagc cagtcgctgc 1740 

ttccacctgt tcagggaaga aaacaccccg tactggcagg catggagcca acaaggagaa 1800 

cctggagctc aacggcagca tcctgagtgg tgggtaccct ggctcggccc ccctccagcg 1860 

caacttcagc attaattctg ttgccagcac ctattctgag tttgcgaagg atccgtccct 1920 

ctctgacagt tccactgttg ggcttcagcg agaactttca aaggcttcca aatctgatgc 1980 

tacttctgga atcctcaatt caaccaacat ccagtcctga gaagccctga tcagtcaacc 2040 

agctgtggct tcctgtgcct agactggacc taattatatg ggggtgactt tagtttttct 2100 

tcagcttagg cgtgcttgaa accttggcca ggttccatga ccatgggcct aacttaaaga 2160 

tgtgaatgag tgttacagtt gaaagcccat cataggttta gtggtcctag gagacttggt 2220 

tttgacttat atacatgaaa agtttatggc aagaagtgca aattttagca tatggggcct 2280 

gacttctcta ccacataatt ctacttgctg aagcatgatc aaagcttgtt ttatttcacc 2340 

actgtaggaa aatgattgac tatgcccatc cctgggggta attttggcat gtatacctgt 2400 

aactagtaat taacatcttt tttgtttagg catgttcaat taatgctgta gctatcatag 2460 

ctttgctctt acctgaagcc ttgtccccac cacacaggac agccttcctc ctgaagagaa 2520 
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tgtctttgtg tgtccgaagt 


tgagatggcc 


tgccctactg 


ccaaagaggt 


gacaggaagg 


2580 


ctgggagcag 


ctttgttaaa 


ttgtgttcag 


ttctgttaca 


cagtgcattg 


ccctttgttg 


2640 


ggggtatgca 


tgtatgaaca 


cacatgcttg 


tcggaacgct 


ttctcggcgt 


ttgtcccttg 


2700 


gctctcatct 


cccccattcc 


tgtgcctact 


ttgcctgagt 


tcttctaccc 


ccgcagttgc 


2760' 


cagccacatt 


gggagtctgt 


ttgttccaat 




gtctttgtcg 


tggagatctg 




gaactttgca 


catgtcacta 


ctggggaggt 






gatgaggcgc 


9P«n 


cctctttacc 


tatcctctca 




ttcttgaagc 


actattattt 


attcttccgc 


2940 


tgtctgcctg 


cagcagtact 


actgtcaaca 


tagtgtaaat 


ggttctcaaa 


agcttaccag 


3000 


tgtggacttg gtgttagcca 


cgctgtttac 


tcatacagta 


cgtgtcctgt 


ttttaaaata 


3060 


tacaattatt 


cttaaaaata 


aattaaaatc 


tgtatactta 


catttcaaaa 


agaaaaaaaa 


3120 


aaaaaaaa 3128 












<210> 134 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_003981 












<400> 134 














tgcagcagta ctactgtcaa 


catagtgtaa 


atggttctca 


aaagcttacc 


agtgtggact 


60 


<210> 135 














<211> 1816 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_004029 












<400> 135 














ggcacccagg 


gtccggcctg 


C g ccttccc g 


ccaggcctgg 


acac gg c 


aacacc g g 




acttcatgtg 


tgcgcgccgg 


ccacacctgc 


agtcacacct 




gc aagag 


i ?n 


atccataccg 


aggcagcgtc 






ccacacct 


ggacacc g 




gacacctggc 


cacacgacct 


gtggccgcgg 


cctggcgtct 


9 g 




240 


ctcccctgtt 


ataacacctg 


accgccacct 


aactgcccct 


g . :L 


t 




ggctcctgag 


agggcagccc 


cacgcgtgct 




ggc cc g 


gagagatcag 




cagcggctgc 


tatgaggggc 


tgcagtggct 


ggacgaggcc 




tccgcgtgcc 




ctggaagcac 


ttcgcgcgca 


aggacctgag 




gcgcgcatct 






ggctgtggcc 


cgcggcaggt 


ggccgcctag 


cagcagggga 


ggtggcccgc 


^ aggCC 9 


540 


tgagactgcg 


gagcgcgccg 


gctggaaaac 


caacttccgc 




gcagcacgcg 


600 


tcgcttcgtg 


atgctgcggg 


ataactcggg 


ggacccggcc 


gacccgcaca 


aggtgtacgc 


660 


gctcagccgg 


gagctgtgct 


ggcgagaagg 


cccaggcacg 


gaccagactg 


g ggc 


720 


ccccgcagct 


gtcccaccac 


ca cagggtgg 






cacacacaca 




tgctggactc 


caagccccag 


gccccctccc 


tgccccagct 


ggtgacaagg 


ggg c cc 


840 


gctccaggca 


gtgcaacaga 


gctgcctggc 


agaccatctg 


ctgacagcgt 


cacggggggc 


900 


agatccagtc 


ccaaccaagg 


ctcctggaga 


gggacaagaa 


gggcttcccc 


tgactggggc 


960 


ctgtgctgga 


ggcgaggccg 


cggccccaga 


gtccccgcac 


caggcagagc 


y g 


n?n 


accctcccca 


agcgcctgca 


ccgcggtgca 


agagcccagc 




ggacg gac 




catcatgtac 


aagggccgca 


cggtgctgca 








1140 


cctatacggc 


cccccagacc 


cagctgtccg 


ggccacagac 


ccccagcagg 


tagcattccc 


1200 


cagccctgcc 


gagctcccgg 


accagaagca 


gctgcgctac 


acggaggaac 


tgctgcggca 


1260 


cgtggcccct 


gggttgcacc 


tggagcttcg 


ggggccacag 


ctgtgggccc 


ggcgcatggg 


1320 


caagtgcaag 


gtgtactggg 


aggtgggcgg 


acccccaggc 


tccgccagcc 


cctccacccc 


. 1380 


agcctgcctg 


ctgcctcgga 


actgtgacac 


ccccatcttc 


gacttcagag 


tcttcttcca 


1440 


agagctggtg 


gaattccggg 


cacggcagcg 


ccgtggctcc 


ccacgctata 


ccatctacct 


1500 


gggcttcggg 


caggacctgt 


cagctgggag 


gcccaaggag 


aagagcctgg 


tcctggtgaa 


1560 


gctggaaccc 


tggctgtgcc 


gagtgcacct 


agagggcacg 


cagcgtgagg 


gtgtgtcttc 


1620 


cctggatagc 


agcagcctca 


gcctctgcct 


gtccagcgcc 


aacagcctct 


atgacgacat 


1680 


cgagtgcttc 


cttatggagc 


tggagcagcc 


cgcctagaac 


ccagtctaat 


gagaactcca 


1740 
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gaaagctgga gcagcccacc tagagctggc cgcggccgcc cagtctaata aaaagaactc 1800 
cagaacaaaa aaaaaa 1816 

<210> 136 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004029 
<400> 136 

agoagcccac ctagagctgg ccgcggccgc ccagtctaat aaaaagaact ccagaacaaa 60 

<210> 137 
<211> 2121 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004203 
<400> 137 

tggaattttt ggcgcgagca gctccgcgcg cgttcacggg ccgttccccc tcacgggagt 60 

cctccgcccg ggcgtccgga acagtcgacg gcagactccg gcccgctgag ccacccgagg 120 

ggtcccgtgg cctccgcgga cccggaatct gggccctcgc ggacccgcgc cccgcccagt 180 

cgccccaggg cttccccaca cccacggagt gaagtcagcc gcggccctgc ctgggaggaa 240 

cttaccgtct accgggaaag gtggccagca gatgtgtcgg gcctggtgag agggtgaggc 300 

gagacggccc gatcgcccag ggccccggaa gctgcggagg tcacccccgc ctggccttag 360 

ctcagggaca ccctggattc acgtgggagc ccctgctcct gcctcccccg tcccaccact 420 

gaggctgttg ggccaggcca gtcatgctag aacggcctcc tgcactggcc atgcccatgc 480 

ccacggaggg caccccgcca cctctgagtg gcacccccat cccagtccca gcctacttcc 540 

gccacgcaga acctggattc tccctcaaga ggcccagggg gctcagccgg agcctcccac 600 

ctccgccccc tgccaagggc agcattccca tcagccgcct cttccctcct cggaccccag 660 

gctggcacca gctgcagccc cggcgggtgt cattccgggg cgaggcctca gagactctgc 720 

agagccctgg gtatgaccca agccggccag agtccttctt ccagcagagc ttccagaggc 780 

tcagccgcct gggccatggc tcctacggag aggtcttcaa ggtgcgctcc aaggaggacg 840 

gccggctcta tgcggtaaag cgttccatgt caccattccg gggccccaag gaccgggccc 900 

gcaagttggc cgaggtgggc agccacgaga aggtggggca gcacccatgc tgcgtgcggc 960 

tggagcaggc ctgggaggag ggcggcatcc tgtacctgca gacggagctg tgcgggccca 1020 

gcctgcagca acactgtgag gcctggggtg ccagcctgcc tgaggcccag gtctggggct 1080 

acctgcggga cacgctgctt gccctggccc atctgcacag ccagggcctg gtgcaccttg 1140 

atgtcaagcc tgccaacatc ttcctggggc cccggggccg ctgcaagctg ggtgacttcg 1200 

gactgctggt ggagctgggt acagcaggag ctggtgaggt ccaggaggga gacccccgct 1260 

acatggcccc cgagctgctg cagggctcct atgggacagc agcggatgtg ttcagtctgg 1320 

gcctcaccat cctggaagtg gcatgcaaca tggagctgcc ccacggtggg gagggctggc 1380 

agcagctgcg ccagggctac ctgccccctg agttcactgc cggtctgtct tccgagctgc 1440 

gttctgtcct tgtcatgatg ctggagccag accccaagct gcgggccacg gccgaggccc 1500 

tgctggcact gcctgtgttg aggcagccgc gggcctgggg tgtgctgtgg tgcatggcag 1560 

cggaggccct gagccgaggg tgggccctgt ggcaggccct gcttgccctg ctctgctggc 1620 

tctggcatgg gctggctcac cctgccagct ggctacagcc cctgggcccg ccagccaccc 1680 

cgcctggctc accaccctgc agtttgctcc tggacagcag cctctccagc aactgggatg 1740 

acgacagcct agggccttca ctctcccctg aggctgtcct ggcccggact gtggggagca 1800 

cctccacccc ccggagcagg tgcacaccca gggatgccct ggacctaagt gacatcaact 1860 

cagagcctcc tcggggctcc ttcccctcct ttgagcctcg gaacctcctc agcctgtttg 1920 

aggacaccct agacccaacc tgagccccag actctgcctc tgcactttta accttttatc 1980 

ctgtgtctct cccgtcgccc ttgaaagctg gggcccctcg ggaactccca tggtcttctc 2040 

tgcctggccg tgtctaataa aaagtatttg aaccttggga gcacccaagc ttgctcatgt 2100 
ggcaaaaaaa aaaaaaaaaa a 2121 

<210> 138 
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<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004203 
<400> 138 

ctggccgtgt ctaataaaaa gtatttgaac cttgggagca cccaagcttg ctcatgtggc 60 

<210> 139 
<211> 1982 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_004207 
<400> 139 

ggcgagaggc gggctgaggc ggcccagcgg cggcaggtga ggcggaacca accctcctgg 60 

ccatgggagg ggccgtggtg gacgagggcc ccacaggcgt caaggcccct gacggcggct 120 

ggggctgggc cgtgctcttc ggctgtttcg tcatcactgg cttctcctac gccttcccca 180 

aggccgtcag tgtcttcttc aaggagctca tacaggagtt tgggatcggc tacagcgaca 240 

cagcctggat ctcctccatc ctgctggcca tgctctacgg gacaggtccg ctctgcagtg 300 

tgtgcgtgaa ccgctttggc tgccggcccg tcatgcttgt ggggggtctc tttgcgtcgc 360 

tgggcatggt ggctgcgtcc ttttgccgga gcatcatcca ggtctacctc accactgggg 420 

tcatcacggg gttgggtttg gcactcaact tccagccctc gctcatcatg ctgaaccgct 4 80 

acttcagcaa gcggcgcccc atggccaacg ggctggcggc agcaggtagc cctgtcttcc 540 

tgtgtgccct gagcccgctg gggcagctgc tgcaggaccg ctacggctgg cggggcggct 600 

tcctcatcct gggcggcctg ctgctcaact gctgcgtgtg tgccgcactc atgaggcccc 660 

tggtggtcac ggcccagccg ggctcggggc cgccgcgacc ctcccggcgc ctgctagacc 720 

.tgagcgtctt ccgggaccgc ggctttgtgc tttacgccgt ggccgcctcg gtcatggtgc 780 

tggggctctt cgtcccgccc gtgttcgtgg tgagctacgc caaggacctg ggcgtgcccg 840 

acaccaaggc cgccttcctg ctcaccatcc tgggcttcat tgacatcttc gcgcggccgg 900 

ccgcgggctt cgtggcgggg cttgggaagg tgcggcccta ctccgtctac ctcttcagct 960 

tctccatgtt cttcaacggc ctcgcggacc tggcgggctc tacggcgggc gactacggcg 1020 

gcctcgtggt cttctgcatc ttctttggca tctcctacgg catggtgggg gccctgcagt 1080 

tcgaggtgct catggccatc gtgggcaccc acaagttctc cagtgccatt ggcctggtgc 1140 

tgctgatgga ggcggtggcc gtgctcgtcg ggcccccttc gggaggcaaa ctcctggatg 1200 

cgacccacgt ctacatgtac gtgttcatcc tggcgggggc cgaggtgctc acctcctccc 1260 

tgattttgct gctgggcaac ttcttctgca ttaggaagaa gcccaaagag ccacagcctg 1320 

aggtggcggc cgcggaggag gagaagctcc acaagcctcc tgcagactcg ggggtggact 1380 

tgcgggaggt ggagcatttc ctgaaggctg agcctgagaa aaacggggag gtggttcaca 1440 

ccccggaaac aagtgtctga gtggctgggc ggggccggca ggcacaggga ggaggtacag 1500 

aagccggcaa cgcttgctat ttattttaca aactggactg gctcaggcag ggccacggct 1560 

gggctccagc tgccggccca gcggatcgtc gcccgatcag tgttttgagg gggaaggtgg 1620 

cggggtggga accgtgtcat tccagagtgg atctgcggtg aagccaagcc gcaaggttac 1680 

aaggcatcct caccaggggc cccgcctgct gctcccaggt ggcctgcggc cactgctatg 1740 

ctcaaggacc tggaaaccca tgcttcgaga caacgtgact ttaatgggag ggtgggtggg 1800 

ccgcagacag gctggcaggg caggtgctgc gtggggccct ctccagcccg tcctaccctg 1860 

ggctcacatg gggcctgtgc ccacccctct tgagtgtctt ggggacagct ctttccaccc 1920 

ctggaagatg gaaataaacc tgcgtgtggg tggagtgttc tcgtgccgaa ttcaaaaagc 1980 
tt 1982 

<210> 140 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004207 
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<400> 140 

cctcttgagt gtcttgggga cagctctttc cacccctgga agatggaaat aaacctgcgt 60 

<210> 141 
<211> 2054 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_004209 
<400> 141 

cgggaggcgg cagcggctgc agcgttggta gcatcagcat cagcatcagc ggcagcggca 60 

gcggcctcgg gcggggccgg ccggacggac aggcggacag aaggcgccag gggcgcgcgt 120 

cccgcccggg ccggccatgg agggcgcctc cttcggcgcg ggccgcgcag gggccgccct 180 

ggaccccgtg agctttgcgc ggcggcccca gaccctgctc cgggtcgcgt cctgggtgtt 240 

ctccatcgcc gtcttcgggc ccatcgtcaa cgagggctac gtgaacaccg acagcggccc 300 

cgagctgcgc tgcgtgttca acgggaacgc gggcgcctgc cgcttcggcg tcgcgctggg 360 

cctcggagcc ttcctcgcct gcgccgcctt cctgctgctc gatgtgcgct tccagcaaat 420 

cagcagcgtc cgcgaccgcc ggcgcgcggt gttgctggac ctgggcttct caggactctg 480 

gtccttcctg tggttcgtgg gcttctgctt cctcaccaat cagtggcagc gcacggcgcc 54 0 

agggccggcc acgacgcagg cgggggacgc ggcgcgggcc gccatcgcct tcagcttctt 600 • 

ctccatcctc agctgggtgg cgctcaccgt gaaggccctg cagcggttcc gcctgggcac 660 

cgacatgtca ctcttcgcca ccgaacagct gagcaccggg gcgagccagg cctaccccgg 720 

ctatccggtg ggcagcggcg tggagggcac cgagacctac cagagcccgc ccttcaccga 780 

gaccctggac accagcccca aagggtacca ggtgcccgcc tactagcggc tggcaggcac 840 

agaccagggc tccaaggcca ccccaccaac gcaggcccca gggtctccgg gacctccctt 900'' 

gggtccttcc agctcagtgc cgcggacaga gtaggtggcc gctttgcgcc atccggggcc 960 

aagagggggt ggacccgcgt gtctgggctg cccctgccaa gttcccccag tccctcagca 1020 

cctggcccca ggactgaggt cctgagaagg ggatagcact gcccaggacg tgtgtcccta 1080 

gcctggaatg gactggcctg gggaaggctt tcccctcttg ggccacacct gctcactctg 1140 

gggttggggg tccagctgcc ctctacgatc aggtgcaggg gctgcccagg acaaagcggg 1200 

ggcaggggaa agacaccacc ctcgccccaa gactggggat cctggccact gttcccatcc 1260 

catgtccctg tgggtagtga ctgtctcgtt tctgtcatgg tggtgcgtcc cgtccggagc 1320 

cactctccac tttctctcac aggctgctag aacagcccag ccctgtcagt gttgtgatca 1380 

tggtccagtc ttcgggtttc acctcctagt actccacaag ctgctcctct ctctgtggcc 1440 

ccggcccctg cccaggtgtg ggtggttctg gccaggaagg cacaaggtag ctgtgggcca 1500 

agacaccagc cctgtcctag cccttcagta agaccttgcc aggagaggag aaggatgcct 1560 

gggtgccagg caagacaagc ccctcagcag gagagaggcc cagaggctcc agctggccac 1620 

cgtgccccac aagatggccc ctgtgtggtt ccctttacct tggcttcctg gcccagtccc 1680 

tgcctctcca cctgcaccct gcttcctggc ccagtcccag gttggagtcc ctctgcatag 1740 

ctgactactc atgcattgct caaagctggc ttttcacatt aagtcaacac caaacgtggt 1800 

tgccacattt catcagacag acacctccct ctggagatgc agttgagtga caaccttgtt 1860 

acattgtagc ctagaccaat tctgtgtgga tatttaagtg aacatgttta caatttttgt 1920 

atatatcact ctctccctct cctgaaagac cagagattgt gtattttcag tgtcccatgt 1980 

tccgactgca ccttctttac aataaagact gtaactgagc tgactgtgaa aaaaaaaaaa 2040 
aaaaaaaaaa aaaa 2054 

<210> 142 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004209 
<400> 142 

gatgcagttg agtgacaacc ttgttacatt gtagcctaga ccaattctgt gtggatattt 60 
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<210> 143 
<211> 1224 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_004217 
<400> 143 

ggccgggaga gtagcagtgc cttggacccc agctctcctc cccctttctc tctaaggatg 60 

gcccagaagg agaactccta cccctggccc tacggccgac agacggctcc atctggcctg 120 

agcaccctgc cccagcgagt cctccggaaa gagcctgtca ccccatctgc acttgtcctc 180 

atgagccgct ccaatgtcca gcccacagct gcccctggcc agaaggtgat ggagaatagc 24 0 

agtgggacac ccgacatctt aacgcggcac ttcacaattg atgactttga gattgggcgt 300 

cctctgggca aaggcaagtt tggaaacgtg tacttggctc gggagaagaa aagccatttc 360 

atcgtggcgc tcaaggtcct cttcaagtcc cagatagaga aggagggcgt ggagcatcag 420 

ctgcgcagag agatcgaaat ccaggcccac ctgcaccatc ccaacatcct gcgtctctac 4 80 

aactattttt atgaccggag gaggatctac ttgattctag agtatgcccc ccgcggggag 540 

ctctacaagg agctgcagaa gagctgcaca tttgacgagc agcgaacagc cacgatcatg 600 

gaggagttgg cagatgctct aatgtactgc catgggaaga aggtgattca cagagacata 660 

aagccagaaa atctgctctt agggctcaag ggagagctga agattgctga cttcggctgg 720 

tctgtgcatg cgccctccct gaggaggaag acaatgtgtg gcaccctgga ctacctgccc 780 

ccagagatga ttgaggggcg catgcacaat gagaaggtgg atctgtggtg cattggagtg 840 

ctttgctatg agctgctggt ggggaaccca ccctttgaga gtgcatcaca caacgagacc 900 

tatcgccgca tcgtcaaggt ggacctaaag ttccccgctt ctgtgcccac gggagcccag 960 

gacctcatct ccaaactgct caggcataac ccctcggaac ggctgcccct ggcccaggtc 1020 

tcagcccacc cttgggtccg ggccaactct cggagggtgc tgcctccctc tgcccttcaa 1080 

tctgtcgcct gatggtccct gtcattcact cgggtgcgtg tgtttgtatg tctgtgtatg 1140 

tataggggaa agaagggatc cctaactgtt cccttatctg ttttctacct cctcctttgt 1200 
ttaataaagg ctgaagcttt ttgt 1224 

<210> 144 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004217 
<400> 144 

gtctgtgtat gtatagggga aagaagggat ccctaactgt tcccttatct gttttctacc 60 

<210> 145 
<211> 983 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_004335 
<400> 145 

gtggaattca tggcatctac ttcgtatgac tattgcagag tgcccatgga agacggggat 60 

aagcgctgta agcttctgct ggggatagga attctggtgc tcctgatcat cgtgattctg 120 

ggggtgccct tgattatctt caccatcaag gccaacagcg aggcctgccg ggacggcctt 180 

cgggcagtga tggagtgtcg caatgtcacc catctcctgc aacaagagct gacc.gaggcc 240 

cagaagggct ttcaggatgt ggaggcccag gccgccacct gcaaccacac tgtgatggcc 300 

ctaatggctt ccctggatgc agagaaggcc caaggacaaa agaaagtgga ggagcttgag 360 

ggagagatca ctacattaaa ccataagctt caggacgcgt ctgcagaggt ggagcgactg 420 

agaagagaaa accaggtctt aagcgtgaga atcgcggaca agaagtacta ccccagctcc 480 

caggactcca gctccgctgc ggcgccccag ctgctgattg tgctgctggg cctcagcgct 540 

ctgctgcagt gagatcccag gaagctggca catcttggaa ggtccgtcct gctcggcttt 600 
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tcgcttgaac 




^ G ^= a = Ca9t 


tC tct 9C " g 


tcatggggca 


aca egg t tag 




cggggagagc 


acggggtagc 


cggagaaggg 






gggccatggg 


720 


gcagtcctgg 


gtgtggggac 


acagtcgggt 


tgacccaggg 


ctgtctccct 




780 


cctccggaca 


atgagtcccc 


cctcttgtct 


cccaccctga 


gattgggcat 


ggggtgcggt 


840 


gtggggggca 


tgtgctgcct 


gttgttatgg 


gttttttttg 


cggggggggt 


tgetttttte 


900 








fctccfct fcgag 


ggagagcaaa 


aaaaaaaaaa 


960 


aaaaaaaaaa 


aaaaaaaaaa 


aaa 983 










<210> 146 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_004335 












<400> 146 














ggttgctttt ttctggggtc 


tttgagctcc 


aaaaaataaa 


cacttccttt 


gagggagagc 


60 


<210> 147 














<211> 3446 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_004336 












<400> 147 














ttctagtttg 


cggttcaggt 


.. . 


99 gcg 


cc c ggcca 


ggacacccc 




ggaaaatgtc 


cttcagatgc 


ttgaagccca 




acaagggca 


a gaccctct 


1 90 


tggtgaatgg 


gaaagataca 


tacagtgggt 


agaagagaat 




a aaagaa a 


i Rn 


cttgataact 


ttactagaac 


atttaatgaa 


ggaattttta 


ga aagaaga 


aa accacaa 




tgacccaaga 


ttcatcagtt 


attgtttaaa 


atttgctgag 


c acag g 


acc cca ca 




attttttgag 


tttctgtaca 


accatgggat 


yy g 


ca cccc c 


tgtacattgc 




ctgggcgggg 


catctggaag 


99 9 




gccag gc g 


tccttcagag 




aggaattcaa 


aaccaggctg 


aacccagaga 


g cc gcaa 


caacaa aca 


ggttatttca 




gacacgcctc 


actgaaaccc 






gaac 


etctgeataa 




tgttcaggtt 


ttaaatcaaa 


ga aaca c 


t t 
aaaa caaat 


ccaggaaa ta 


acatggcctg 


600 


catttctaag 


aatcagggtt 


cagagc c 




tcttcagctt 


gtgataaaga 


660 


gtcaaatatg 


gaacgaagag 


ga cacga 


ttctaaatca 


gaatattctg 


tgcactcatc 




tttggcatcc 


aaagttgatg 


9 9 99 


tgttatgtat 


gcaaggaga 


agcttattcg 


780 


tggggaatca 


gaattttcct 


9 9a 


gagagcccag 


aaa acaa c 


aacggagaaa 


840 


gcatgagcaa 


tgggtaaatg 


Taaaat 3 " 


a a gaaa 


aggaaagaag 


caaat get tt 




tgaagaacag 


ctattaaaac 






a gaag gc 


atcaggtggt 


Qfin 


ggagacatcc 


catgaggatc 


tgcccgcttc 


ccaggaaagg 




atccagcacg 


in n 


tatggggcca 


agtgtaggct 


cccagcagga 


actgagagcg 


ccat^tcttc 




1080 


tcagcagaca 


ccagtgaaca 


tggaaaagaa 


cccaagagag 


gcacctcctg 






tttggcaaat 


gctatttctg 


cagctttggt 


gtccccagcc 


accagccaga 


acatt° C t CC 
y 9 


1200 


tcctgttcct 


ttgaaagccc 


agacagtaac 


agactccatg 








tgctggatgt 


gtgaataaga 


gtactcatga 


attcaagcca 


caaaataaa 5 


caq^at 393 


1320 


agaagggtgt 


gaaacacata 


aggttgccaa 


cacaagttct 


tttcacacaa 


ctcca 




atcactggga 


atggttcagg 


caacgccatc 


caaagtgcag 


ccatcaccca 


ccgtgcacac 


1440 


aaaagaagca 


ttaggtttca 


tcatgaatat 


gtttcaggct 


cctacacttc 


ctgatatttc 


1500 


tgatgacaaa 


gatgaatggc 


aatctctaga 


tcaaaatgaa 


gatgcatttg 


aagcccagtt 


1560 


tcaaaaaaat 


gtaaggtcat 


ctggggcttg 


gggagtcaat 


aagatcatct 


cttctttgtc 


1620 


atctgctttt 


catgtgtttg 


aagatggaaa 


caaagaaaat 


tatggattac 


cacagcctaa 


1680 


aaataaaccc 


acaggagcca 


ggacctttgg 


agaacgctct 


gtcagcagac 


ttccttcaaa 


1740 


accaaaggag 


gaagtgcctc 


atgctgaaga 


gtttttggat 


gactcaactg 


tatggggtat 


1800 


tcgctgcaac 


aaaaccctgg 


cacccagtcc 


taagagccca 


ggagacttca 


catctgctgc 


1860 


acaacttgcg 


tctacaccat 


tccacaagct 


tccagtggag 


tcagtgcaca 


ttttagaaga 


1920 


taaagaaaat 


gtggtagcaa 


aacagtgtac 


ccaggcgact 


ttggattctt 


gtgaggaaaa 


1980 
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catggtggtg ccttcaaggg atggaaaatt 
ggccttgtcg tctcacatgt attcagcatc 
tggggtactt acctgtgagg cagagttggg 
cgctgccatt gcagaagatc caccagatgc 
gatgagttca cttgggactg ttgatgctcc 
taagctgatt ttcaaacttt tatctgggct 
ttttgaatgg caatgtaaac ttccagccat 
taagctggtc tatgtccatc accttcttgg 
tacccaggga gatctgaatg atgctaaaaa 
gcctgccaac ccctgggaat tctacattgg 
tatgcagcac atgtttatga agttctattc 
agtaggagag ctctacagct atggaacatt 
ccctgaaaaa gtgatgcctc aaggtcttgt 
gattgagcaa gtgcatgact gtgaaatcat 
acttggaaac ggatttttgg aacaggatga 
gattgacctg ggtcagagta tagatatgaa 
aaagtgtgaa acatctggtt ttcagtgtgt 
ccagatcgat tactttgggg ttgctgcaac 
gaaagtgaaa aatgaaggag gagagtgtaa 
tttggatatg tggaatgaat tttttcatgt 
tccatctttg gatttgttaa ggcaaaagct 
caagattagg gccctacgta ataggctaat 
aaaataaaat ttggatatag acagtcctta 
ctttaaacct gttttttttt catttattgt 
atggaatatt tccatgtaaa aaaaaa 34' 



cagtccaatt caagagaaaa gcccaaaaca 2040 

cttacttcgt ctgagccagc ctgctgcagg 2100 

cgttgaggct tgcagactca cagacactga 2160 

tattgctggg ctccaagcag aatggatgca 2220 

aaacttcatt gttgggaacc catgggatga 2280 

ttctaaacca gtgagttcct atccaaatac 234 0 

caagcccaag actgaatttc aattgggttc 2400 

agaaggagcc tttgcccagg tgtacgaagc 2460 

taaacagaaa tttgttttaa aggtccaaaa 2520 

gacccagttg atggaaagac taaagccatc 2580 

tgcccactta ttccagaatg gcagtgtatt 264 0 

attaaatgcc attaacctct ataaaaatac 2700 

catctctttt gctatgagaa tgctttacat 2760 

tcatggagac attaaaccag acaatttcat 2820 

tgaagatgat ttatctgctg gcttggcact 2880 

actttttcca aaaggaacta tattcacagc 2940 

tgagatgctc agcaacaaac catggaacta 3000 

agtatattgc atgctctttg gcacttacat • 3060 

gcctgaaggt ctttttagaa ggcttcctca 3120 

tatgttgaat attccagatt gtcatcatct 3180 

gaagaaagta tttcaacaac actatactaa 3240 

tgtactgctc ttagaatgta agcgttcacg 3300 

aaaatcacac tgtaaatatg aatctgctca 3360 

ttatgtaaat gtttgttaaa aataaatccc 3420 



<210> 148 
<211> 60 
<212> DNA 



<213> Homo sapiens 
<300> 

<308> NM_004 336 
<400> 148 

ttagggccct acgtaatagg ctaattgtac tgctcttaga atgtaagcgt tcacgaaaat 60 

<210> 149 
<211> 739 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004345 
<400> 149 

taaagcaaac cccagcccac accctggcag gcagccaggg atgggtggat 
cctggttggg cttttgcatc aggctcaggc tgggcataaa ggaggctcct 
gggaggcaga catggggacc atgaagaccc aaagggatgg ccactccctg 
cactggtgct cctgctgctg ggcctggtga tgcctctggc catcattgcc 
gctacaagga agctgtgctt cgtgctatag atggcatcaa ccagcggtcc 
acctctaccg cctcctggac ctggacccca ggcccacgat ggatggggac 
caaagcctgt gagcttcaca gtgaaggaga cagtgtgccc caggacgaca 
cagaggattg tgacttcaag aaggacgggc tggtgaagcg gtgtatgggg 
tcaaccaggc caggggctcc tttgacatca gttgtgataa ggataacaag 
tgctgggtga tttcttccgg aaatctaaag agaagattgg caaagagttt 
tccagagaat caaggatttt ttgcggaatc ttgtacccag gacagagtcc 
cctaccctgg ctcaggcttc tgggctctga gaaataaact atgagagcaa 
aaaaaaaaaa aaaaaaaaa 739 



caggaaggct 60 

gtgggctaga 120 

gggcggtggt 180 

caggtcctca 240 

tcggatgcta 300 

ccagacacgc 360 

cagcagtcac 420 

acagtgaccc 480 

agatttgccc 540 

aaaagaattg 600 

tagtgtgtgc 660 

tttcaaaaaa 720 
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<210> 150 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004345 
<400> 150 

gcaaagagtt taaaagaatt gtccagagaa tcaaggattt tttgcggaat cttgtaccca 60 

<210> 151 
<211> 1432 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_004577 
<400> 151 

gaggaaaatt cttccagcga tggtctccca ctcagagctg aggaagcttt tctactcagc 60 

agatgctgtg tgttttgatg ttgacagcac ggtcatcaga gaagaaggaa tcgatgagct 120 ■ 

agccaaaatc tgtggcgttg aggacgcggt gtcagaaatg acacggcgag ccatgggcgg 180 

ggcagtgcct ttcaaagctg ctctcacaga gcgcttagcc ctcatccagc cctccaggga 240 

gcaggtgcag agactcatag cagagcaacc cccacacctg acccccggca taagggagct 300 

ggtaagtcgc ctacaggagc gaaatgttca ggttttccta atatctggtg gctttaggag 360 

tattgtagag catgttgctt caaagctcaa tatcccagca accaatgtat ttgccaatag 420 

gctgaaattc tactttaacg gtgaatatgc aggttttgat gagacgcagc caacagctga 480 

atctggtgga aaaggaaaag tgattaaact tttaaaggaa aaatttcatt ttaagaaaat 540 

aatcatgatt ggagatggtg ccacagatat ggaagcctgt cctcctgctg atgctttcat 600 

tggatttgga ggaaatgtga tcaggcaaca agtcaaggat aacgccaaat ggtatatcac 660 

tgattttgta gagctgctgg gagaactgga agaataacat ccattgtcgt acagctccaa 720 

acaacttcag atgaattttt acaagttata cagattgata ctgtttgctt acagttgcct 780 

attacaactt gctatagaaa gttggtacaa atgatctgta ctttaaacta cagttaggaa 840 

tcctagaaga ttgctttttt ttttttttta actgtagttc cagtattata tgatgactat 900 

tgatttcctg gagaggtttt tttttttttt gagacagaat cttgctctgt tgcccaggct 960 

ggagtgcagt ggcgcggtct cggctcactg caagctctgc ctcccaggtt cacgccattc 1020 

tcctgcctca gcctcccgag tagctgggac tacaggcacc cgccaccaca tccggctaat 1080 

tttttgtatt tttagtagag acggggtttg accgtgttag ccaggatggt cttgatctcc 1140 

tgaccttgtg atccgcctgc ctcagcctcc caaagtgctg ggattacagg cttgggccac 1200 
cgcgcccagc caatgtccta gagagttttg tgatctgaat tctttatgta tatttgtagc ' 1260 

tatatttcat acaaagtgct ttaagtgtgg agagtcaatt aaacaccttt actcttagaa 1320 

atacggattc ggcagccttc agtgaatatt ggtttctctt tggtatgtca ataaaagttt 1380 
atccgtatgt cagaacggat ttgtggaaaa aaaaaaaaaa aaaaaaaaaa aa 1432 

<210> 152 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004577 
<400> 152 

tagaaatacg gattcggcag ccttcagtga atattggttt ctctttggta tgtcaataaa 60 

<210> 153 

<211> 1530 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> NM_004701 
<400> 153 

aatcctggaa caaggctaca gcgtcgaaga tccccagcgc tgcgggctcg gagagcagtc 60 

ctaacggcgc ctcgtacgct agtgtcctcc cttttcagtc cgcgtccctc cctgggccgg 120 

gctggcactc ttgccttccc cgtccctcat ggcgctgctc cgacgcccga cggtgtccag 180 

tgatttggag aatattgaca caggagttaa ttctaaagtt aagagtcatg tgactattag 240 

gcgaactgtt ttagaagaaa ttggaaatag agttacaacc agagcagcac aagtagctaa 300 

gaaagctcag aacaccaaag ttccagttca acccaccaaa acaacaaatg tcaacaaaca 360 

actgaaacct actgcttctg tcaaaccagt acagatggaa aagttggctc caaagggtcc 420 

ttctcccaca cctgaggatg tctccatgaa ggaagagaat ctctgccaag ctttttctga 480 

tgccttgctc tgcaaaatcg aggacattga taacgaagat tgggagaacc ctcagctctg 540 

cagtgactac gttaaggata tctatcagta tctcaggcag ctggaggttt tgcagtccat 600 

aaacccacat ttcttagatg gaagagatat aaatggacgc atgcgtgcca tcctagtgga 660 

ttggctggta caagtccact ccaagtttag gcttctgcag gagactctgt acatgtgcgt 720 

tggcattatg gatcgatttt tacaggttca gccagtttcc cggaagaagc ttcaattagt 780 

tgggattact gctctgctct tggcttccaa gtatgaggag atgttttctc caaatattga 840 

agactttgtt tacatcacag acaatgctta taccagttcc caaatccgag aaatggaaac 900 

tctaattttg aaagaattga aatttgagtt gggtcgaccc ttgccactac acttcttaag 960 

gcgagcatca aaagccgggg aggttgatgt tgaacagcac actttagcca agtatttgat 1020 

ggagctgact ctcatcgact atgatatggt gcattatcat ccttctaagg tagcagcagc 1080 

tgcttcctgc ttgtctcaga aggttctagg acaaggaaaa tggaacttaa agcagcagta 1140 

ttacacagga tacacagaga atgaagtatt ggaagtcatg cagcacatgg ccaagaatgt 1200 

ggtgaaagta aatgaaaact taactaaatt catcgccatc aagaataagt atgcaagcag 1260 

caaactcctg aagatcagca tgatccctca gctgaactca aaagccgtca aagaccttgc 1320 

ctccccactg ataggaaggt cctaggctgc cgtgggccct ggggatgtgt gcttcattgt 1380 

gccctttttc ttattggttt agaactcttg attttgtaca tagtcctctg gtctatctca 1440 

tgaaacctct tctcagacca gttttctaaa catatattga ggaaaaataa agcgattggt 1500 
ttttcttaag gtaaaaaaaa aaaaaaaaaa 1530 

<210> 154 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004701 
<400> 154 

agaactcttg attttgtaca tagtcctctg gtctatctca tgaaacctct tctcagacca 60 

<210> 155 
<211> 2536 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_004702 
<400> 155 

agcgggtgcg gggcgggacc ggcccggcct atatattggg ttggcgccgg cgccagctga 60 

gccgagcggt agctggtctg gcgaggtttt atacacctga aagaagagaa tgtcaagacg 120 

aagtagccgt ttacaagcta agcagcagcc ccagcccagc cagacggaat ccccccaaga 180 

agcccagata atccaggcca agaagaggaa aactacccag gatgtcaaaa gaagtctggc 240 

taaacatgtt aaaaaaggag agcagatatg ttcatgacaa acattttgaa gttctgcatt 300 

ctgacttgga accacagatg aggtccatac ttctagactg gcttttagag gtatgtgaag 360 

tatacacact tcatagggaa acattttatc ttgcacaaga cttttttgat agatttatgt 420 

tgacacaaaa ggatataaat aaaaatatgc ttcaactcat tggaattacc tcattattca 480 

ttgcttccaa acttgaggaa atctatgctc ctaaactcca agagtttgct tacgtcactg 540 

atggtgcttg cagtgaagag gatatcttaa ggatggaact cattatatta aaggctttaa 600 
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aatgggaact 


ttgtcctgta 


acaatcatct 


cctggctaaa 


tctctttctc 


caagttgatg 


660 


ctcttaaaga 


tgctcctaaa 


gttcttctac 


ctcagtattc 


tcaggaaaca 


ttcattcaaa 


720 


tagctcagct 


tttagatctg 


tgtattctag 


ccattgattc 


attagagttc 


cagtacagaa 


780 


tactgactgc 


tgctgccttg 


tgccatttta 


cctccattga 


agtggttaag 


aaagcctcag 


840 


gtttggagtg 


ggacagtatt 


tcagaatgtg 


tagattggat 


ggtacctttt 




900 


taaaaagtac 


tagtccagtg 


aagctgaaga 


cttttaagaa 


gattcctatg 


gaagacagac 


960 


ataatatcca 


gacacataca 


aactatttgg 


ctatgctgga 




tacataaaca 




ccttcagaaa 


agggggacag 


ttgtcaccag 


tgtgcaatgg 






1080 


agagcactga 


aaaaccacca 


ggaaaacact 


aaagaagata 


actaaaca 


caa C ttq C aa 


1 1 an 


ttcaccaaga 


ttgggtagaa 






aagttttaca 


4- 4- 

gaaagtagtg 




ctgtgafcfcga 




caattcacaa 


qttacactgc 


cauuccga u l 


ttaaaactta 




caattggcac 


taaagaatac 


atttaattat 


ttcctatgtt 




gaaacagcag 


i ^9n 


gacttgfctfca 






aggttactgg 


atagaagcca 


accacag c 




ataccatagc 


aatgtttttc 


ctttaatcca 




gtttatcttg 


9 ^ aac ag 9 


i ^nn 


aattttgtca 


ctggagtttt 


ggactggata 


agtgctacct 




ac aag ga 




acagtacttt 


gaatctagtt 


gttagattct 


caaaattcct 


acactcttga 






ttggttcttg 


aaaattaaat 


ttaaacttgt 


ttacaaaggt 


ttagttttgt 


t a at- 3 

gg g 




ctaatttatc 


tatagctgct 


atagcaagct 




ttgaatttct 






aaatttaatg 


ttttttaaac 


tagtttattt 


gcctfcgccat 


aacacatttt 


ttaactaata 


1740 


aggcttagat 


gaacatggtg 


ttcaacctgt 


gctctaaaca 


ataaaaatar 
y u y yy a y *- a, ~ 




1800 


ataaacaaga 


taaatgctgt 


ggctccttcc 


taactggggc 








ttggtaataa 


cctttttgta 


tatcacaatt 


tgggtgaaaa 


acttaaqtac 


cctttcaaac 


1920 


tatttatatg 


aggaagtcac 


tttactactc 




t 


ttttttttta 




atttagtgtg 


actaaggctt 


tatttatgtt 


tgtgaaactg 


ttaaggtcct 


ttctaaattc 


2040 


ctccattgtg 


agataaggac 


agtgtcaaag 


tgataaagct 


taacacttga 


cctaaacttc 


2100 


tattttctta 


aggaagaaga 


gtattaaata 


tatactgact 


cctagaaatc 


tatttattaa 


2160 


aaaaagacat 


gaaaacttgc 


tgtacatagg 


ctagctattt 


ctaaatattt 


taaattagct 


2220 


tttctaaaaa 


aaaaatccag 


cctcataaag 


tagattagaa 


aactagattg 


ctagtttatt 


2280 




atatgtgaat 


ctcttctccc 


tttgaagaaa 


ctatacattt 


attgttacgg 


2340 


tatgaagtct 


tctgtatagt 


ttgtttttaa 


actaatattt 


gtttcagtat 


tttgtctgaa 


2400 


aagaaaacac 


cactaattgt 


gtacatatgt 


attatataaa 


cttaacc'ttt 


taatactgtt 


2460 


tatttttagc 


ccattgttta 


aaaaataaaa 


gttaaaaaaa 


tttaactgct 


taaaagtaaa 


2520 


aaaaaaaaaa 


aaaaaa 2536 










<210> 156 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NMJD04702 












<400> 156 














gtttgtgaaa 


ctgttaaggt 


cctttctaaa 


ttcctccatt 


gtgagataag 


gacagtgtca 


60 


<210> 157 














<211> 1491 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_004710 












<400> 157 














gcggcggcgg 


cagcggcggc 


gacggcgaca 


tggagagcgg 


ggcctacggc 


gcggccaagg 


60 


cgggcggctc 


cttcgacctg 


cggcgcttcc 


tgacgcagcc 


gcaggtggtg 


gcgcgcgccg 


120 


tgtgcttggt 


cttcgccttg 


atcgtgttct 


cctgcatcta 


tggtgagggc 


tacagcaatg 


180 


cccacgagtc 


taagcagatg 


tactgcgtgt 


tcaaccgcaa 


cgaggatgcc 


tgccgctatg 


240 


gcagtgccat 


cggggtgctg 


gccttcctgg 


ectcggcctt 


cttcttggtg 


gtcgacgcgt 


300 


atttccccca 


gatcagcaac 


gccactgacc 


gcaagtacct 


ggtcattggt 


gacctgctct 


360 


tctcagctct ctggaccttc 


ctgtggtttg 


ttggtttctg 


cttcctcacc 


aaccagtggg 


420 
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cagtcaccaa cccgaaggac gtgctggtgg gggccgactc tgtgagggca gccatcacct 480 

tcagcttctt ttccatcttc tcctggggtg tgctggcctc cctggcctac cagcgctaca 540 

aggctggcgt ggacgacttc atccagaatt acgttgaccc cactccggac cccaacactg 600 

cctacgcctc ctacccaggt gcatctgtgg acaactacca acagccaccc ttcacccaga 660 

acgcggagac caccgagggc taccagccgc cccctgtgta ctgagcggcg gttagcgtgg 720 

gaagggggac agagagggcc ctcccctctg ccctggactt tcccatgagc ctcctggaac 780 

tgccagcccc tctctttcac ctgttccatc ctgtgcagct gacacacagc taaggagcct 84 0 

catagcctgg cgggggctgg cagagccaca ccccaagtgc ctgtgcccag agggcttcag 900 

tcagccgctc actcctccag ggcattttta ggaaagggtt ttcagctagt gtttttcctc 960 

gcttttaatg acctcagccc cgcctgcagt ggctagaagc cagcaggtgc ccatgtgcta 1020 

ctgacaagtg cctcagcttc cccccggccc gggtcaggcc gtgggagccg ctattatctg 1080 

cgttctctgc caaagactcg tgggggccat cacacctgcc ctgtgcagcg gagccggacc 114 0 

aggctcttgt gtcctcactc aggtttgctt cccctgtgcc cactgctgta tgatctgggg 1200 

gccaccaccc tgtgccggtg gcctctgggc tgcctcccgt ggtgtgaggg cggggctggt 1260 

gctcatggca cttcctcctt gctcccaccc ctggcagcag ggaagggctt tgcctgacaa 1320 

cacccagctt tatgtaaata ttctgcagtt gttacttagg aagcctgggg agggcagggg 1380 

tgccccatgg ctcccagact ctgtctgtgc cgagtgtatt ataaaatcgt gggggagatg 1440 
cccggcctgg gatgctgttt ggagacggaa taaatgtttt ctcattcagt a 1491 

<210> 158 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004710 
<400> 158 

ttgcctgaca acacccagct ttatgtaaat attctgcagt tgttacttag gaagcctggg 60 

<210> 159 
<211> 3324 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004856 
<400> 159 

gcagagcacc gcgccttagc cgcgaagttc tagttcttgc tgccggtcct aacgtcccgc 60 

agtcttcgcc agccagccgt cccgcatgcg cgtttgggcg gcgtggagcc tgctgccatg 120 

aagtcagcga gagctaagac accccggaaa cctaccgtga aaaaagggtc ccaaacgaac 180 

cttaaagacc cagttggggt atactgtagg gtgcgcccac tgggctttcc tgatcaagag 240 

tgttgcatag aagtgatcaa taatacaact gttcagcttc atactcctga gggctacaga 300 

ctcaaccgaa atggagacta taaggagact cagtattcat ttaaacaagt atttggcact 360 

cacaccaccc agaaggaact ctttgatgtt gtggctaatc ccttggtcaa tgacctcatt 420 

catggcaaaa atggtcttct ttttacatat ggtgtgacgg gaagtggaaa aactcacaca 480 

atgactggtt ctccagggga aggagggctg cttcctcgtt gtttggacat gatctttaac 540 

agtatagggt catttcaagc taaacgatat gttttcaaat ctaatgatag gaatagtatg 600 

gatatacagt gtgaggttga tgccttatta gaacgtcaga aaagagaagc tatgcccaat 660 

ccaaagactt cttctagcaa acgacaagta gatccagagt ttgcagatat gataactgta 720 

caagaattct gcaaagcaga agaggttgat gaagatagtg tctatggtgt atttgtctct 780 

tatattgaaa tatataataa ttacatatat gatctattgg aagaggtgcc gtttgatccc 840 

ataaaaccca aacctccaca atctaaattg cttcgtgaag ataagaacca taacatgtat 900 

gttgcaggat gtacagaagt tgaagtgaaa tctactgagg aggcttttga agttttctgg 960 

agaggccaga aaaagagacg tattgctaat acccatttga atcgtgagtc cagccgttcc 1020 

catagcgtgt tcaacattaa attagttcag gctcccttgg atgcagatgg agacaatgtc 1080 

ttacaggaaa aagaacaaat cactataagt cagttgtcct tggtagatct tgctggaagt 1140 

gaaagaacta accggaccag agcagaaggg aacagattac gtgaagctgg taatattaat 1200 

cagtcactaa tgacgctaag aacatgtatg gatgtcctaa gagagaacca aatgtatgga 1260 

actaacaaga tggttccata tcgagattca aagttaaccc atctgttcaa gaactacttt 1320 
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gatggggaag gaaaagtgcg gatgatcgtg tgtgtgaacc ccaaggctga agattatgaa 1380 

gaaaacttgc aagtcatgag atttgcggaa gtgactcaag aagttgaagt agcaagacct 1440 

gtagacaagg caatatgtgg tttaacgcct gggaggagat acagaaacca gcctcgaggt 1500 

ccagttggaa atgaaccatt ggttactgac gtggttttgc agagttttcc acctttgccg 1560 

tcatgcgaaa ttttggatat caacgatgag cagacacttc caaggctgat tgaagcctta 1620 

gagaaacgac ataacttacg acaaa'tgatg attgatgagt ttaacaaaca atctaatgct 1680 

tttaaagctt tgttacaaga atttgacaat gctgttttaa gtaaagaaaa ccacatgcaa 1740 

gggaaactaa atgaaaagga gaagatgatc tcaggacaga aattggaaat agaacgactg 1800 

gaaaagaaaa acaaaacttt agaatataag attgagattt tagagaaaac aactactatc 1860 

tatgaggaag ataaacgcaa tttgcaacag gaacttgaaa ctcagaacca gaaacttcag 1920 

cgacagtttt ctgacaaacg cagattagaa gccaggttgc aaggcatggt gacagaaacg 1980 

acaatgaagt gggagaaaga atgtgagcgt agagtggcag ccaaacagct ggagatgcag 2040 

aataaactct gggttaaaga tgaaaagctg aaacaactga aggctattgt tactgaacct 2100 

aaaactgaga agccagagag accctctcgg gagcgagatc gagaaaaagt tactcaaaga 2160 

tctgtttctc catcacctgt gcctttactc tttcaacctg atcagaacgc accaccaatt 2220 

cgtctccgac acagacgatc acgctctgca ggagacagat gggtagatca taagcccgcc 2280 

tctaacatgc aaactgaaac agtcatgcag ccacatgtcc ctcatgccat cacagtatct 2340 

gttgcaaatg aaaaggcact agctaagtgt gagaagtaca tgctgaccca ccaggaacta 24 00 

gcctccgatg gggagattga aactaaacta attaagggtg atatttataa aacaaggggt 2460 

ggtggacaat ctgttcagtt tactgatatt gagactttaa agcaagaatc accaaatggt 2520 

agtcgaaaac gaagatcttc cacagtagca cctgcccaac cagatggtgc agagtctgaa 2580 

tggaccgatg tagaaacaag gtgttctgtg gctgtggaga tgagagcagg atcccagctg 2640 

ggacctggat atcagcatca cgcacaaccc aagcgcaaaa agccatgaac tgacagtccc 2700 

agtactgaaa gaacattttc atttgtgtgg atgatttctc gaaagccatg ccagaagcag 27 60 

tcttccaggt catcttgtag aactccagct ttgttgaaaa tcacggacot cagctacatc 2820 

atacactgac ccagagcaaa gctttcccta tggttccaaa gacaactagt attcaacaaa 2880 

ccttgtatag tatatgtttt gccatattta atattaatag cagaggaaga ctcctttttt 2940 

catcactgta tgaatttttt ataatgtttt tttaaaatat atttcatgta tacttataaa 3000 

ctaattcaca caagtgtttg tcttagatga ttaaggaaga ctatatctag atcatgtctg 3060 

attttttatt gtgacttctc cagccctggt ctgaatttct taaggtttta taaacaaatg 3120 

ctgctattta ttagctgcaa gaatgcactt tagaactatt tgacaattca gactttcaaa 3180 

ataaagatgt aaatgactgg ccaataataa ccattttagg aaggtgtttt gaattctgta 3240 

tgtatatatt cactttctga catttagata tgccaaaaga attaaaatca aaagcactaa 3300 

gaaataaaaa aaaaaaaaaa aaaa 3324 

<210> 160 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004856 
<400> 160 

caaagctttc cctatggttc aaagacaact agtattcaac aaaccttgta tagtgtatgt 60 

<210> 161 
<211> 1536 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004900 
<400> 161 

acagagcttc aaaaaaagag cgggacaggg acaagcgtat 
ccacagatca gaaatccgat ggagcggatg tatcgagaca 
aacgaaccca tcctctatgg tcggagctac acttggctgt 
aggggccgct caaatctcct ttgggacaca ggggtctttc 
cctcagtacc acgcagaaat gtgcttcctc tcttggttct 
tacaagtgtt tccagatcac ctggtttgta tcctggaccc 



ctaagaggct gaacatgaat 60 

cattctacga caactttgaa 120 

gctatgaagt gaaaataaag 180 

gaggccaggt gtatttcaag 240 

gtggcaacca gctgcctgct 300 

cctgcccgga ctgtgtggcg 360 
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aagctggccg 
ctctactact 
gcccgcgtga 
aatgaaggtc 
cgcacgctaa 
aataatgacc 
ctggacaatg 
aagaatcttc 
ccttctttgc 
ccctgcttct 
gtgagactgc 
ctgcaaatgc 
tactgctggg 
gaggagcaca 
tgaaggatgg 
gatcttcttc 
ccagcaaagc 
taattgaaaa 
agaatagttt 
cattaagaat 



aattcctgtc 
actgggaaag 
cgatcatgga 
agcaattcat 
aggagattct 
ctttggtcct 
gcacctgggt 
tctgtggctt 
agttggaccc 
cctggggctg 
gcatcttcgc 
tgcgggatgc 
acacctttgt 
gccaagccct 
gcctcagtct 
caagaaatgc 
aatgtgctcc 
tttctcttat 
tcaatgtatt 
cttccataat 



tgagcacccc 
agattaccga 
ctatgaagaa 
gccttggtac 
cagatacctg 
tcgacggcgc 
cctgatggac 
ttacggccgc 
ggcccagatc 
tgccggggaa 
tgcccgcatc 
tggggcccaa 
gtaccgccag 
gagtgggagg 
ctaaggaagg 
aaacagaccg 
tgatcaagta 
gttccaagtg 
aatgaagtga 
tgtttccaca 



aatgtcaccc tgaccatctc 
agggcgctct gcaggctgag 
tttgcatact gctgggaaaa 
aaattcgatg aaaattatgc 
atggatccag acacattcac 
cagacctact tgtgctatga 
cagcacatgg gctttctatg 
catgcggagc tgcgcttctt 
tacagggtca cttggttcat 
gtgcgtgcgt tccttcagga 
tatgattacg accccctata 
gtctccatca tgacctacga 
ggatgtccct tccagccctg 
ctgcgggcca ttctccagaa 
cagagacctg ggttgagcag 
ttcaccacca tctccagctg 
gattttttaa aaatcagagt 
tacaagagta agattatgct 
ttaattggct ccatatttag 
aacact 1536 



tgccgcccgc 
tcaggcagga 
ctttgtgtac 
attcctgcac 
tttcaacttt 
ggtggagcgc 
caacgaggct 
ggacctggtt 
ctcctggagc 
gaacacacac 
taaggaggcg 
tgagtttgag 
ggatggacta 
tcagggaaac 
cagaataaaa 
ctcacagaca 
caattaattt 
caatattccc 
actaataaaa 



420- 

480' 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



<210> 162 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004900 
<400> 162 

tgctcacaga caccagcaaa gcaatgtgct cctgatcaag tagatttttt aaaaatcaga 

<210> 163 
<211> 1722 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> NM_004988 



<400> 163 
cgtagagttc 
acaggccaac 
ctgcctgtgg 
agtcatcatg 
ccaacaagag 
ggtcctgggc 
tcagggagcc 
gggttccagc 
ccgagcagta 
agccagggag 
ctgttttcct 
cgtgaaggaa 
ctatgatggc 
cctggtcatg 
gagtgtgatg 
gctcacccaa 
tcccgcacgc 
agtccttgag 
tgaagcagct 
tgggaggggg 



ggccgaagga 
ccagaggaca 
gtcttcattg 
tctcttgagc 
gccctgggcc 
accctggagg 
tccgcctttc 
agccgtgaag 
atcactaaga 
ccagtcacaa 
gagatcttcg 
gcagacccca 
ctgctgggtg 
attgcaatgg 
gaggtgtatg 
gatttggtgc 
tatgagttcc 
tatgtgatca 
ttgagagagg 
actgggccag 



acctgaccca 
ggattccctg 
cccagctcct 
agaggagtct 
tggtgtgtgt 
aggtgcccac 
ccactaccat 
aggaggggcc 
aggtggctga 
aggcagaaat 
gcaaagcctc 
ccggccactc 
ataatcagat 
agggcggcca 
atgggaggga 
aggaaaagta 
tgtggggtcc 
aggtcagtgc 
aggaagaggg 
tgcaccttcc 



ggctctgtga 
gaggccacag 
gcccacactc 
gcactgcaag 
gcaggctgcc 
tgctgggtca 
caacttcact 
aagcacctct 
tttggttggt 
gctggagagt 
tgagtccttg 
ctatgtcctt 
catgcccaag 
tgctcctgag 
gcacagtgcc 
cctggagtac 
aagggccctt 
aagagttcgc 
agtctgagca 
agggccgcgt 



ggaggcaagg 
aggagcacca 
ctgcctgctg 
cctgaggaag 
gcctcctcct 
acagatcctc 
cgacagaggc 
tgtatcctgg 
tttctgctcc 
gtcatcaaaa 
cagctggtct 
gtcacctgcc 
acaggcttcc 
gaggaaatct 
tatggggagc 
cggcaggtgc 
gctgaaacca 
tttttcttcc 
tgagttgcag 
ccagcagctt 



ttttcagggg 
aggagaagat 
ccctgacgag 
cccttgaggc 
cctctcctct 
cccagagtcc 
aacccagtga 
agtccttgtt- 
tcaaatatcg 
attacaagca 
ttggcattga 
taggtctctc 
tgataattgt 
gggaggagct 
ccaggaagct 
cggacagtga 
gctatgtgaa 
catccctgcg 
ccagggccag 
cccctgcctc 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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gtgtgacatg 


aggcccattc 


ttcactctga 


agagagcggt 


cagtgttctc 


agtagtaggt 


1260 


ttctgttcta 


ttgggtgact 


tggagattta 


tctttgttct 


cttttggaat 


tgttcaaatg 


1320 


ttttttttta 


agggatggtt 


gaatgaactt 


cagcatccaa 


gtttatgaat 


gacagcagtc 


1380 


acacagttct 


gtgtatatag 


tttaagggta 


agagtcttgt 


gttttattca 


gattgggaaa 


1440 


tccattctat 


tttgtgaatt 


gggataataa 


cagcagtgga 


ataagtactt 


agaaatgtga 


1500 


aaaatgagca 


gtaaaataga 


tgagataaag 


aactaaagaa 


attaagagat 


agtcaattct 


1560 


tgccttatac 


ctcagtctat 


tctgtaaaat 


ttttaaagat 


atatgcatac 


ctggatttcc 


1620 


ttggcttctt 


tgagaatgta 


agagaaatta 


aatctgaata 


aagaattctt 


cctgttaaaa 


1680 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 




aa 1722 






<210> 164 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_004988 












<400> 164 














cagattggga 


aatccattct 


attttgtgaa 


ttgggataat 


aacagcagtg 

g g g 


gaataagtac 


60 


<210> 165 














<211> 2334 














<212> DNA 














<213> Homo sapiens 












<300> 














<308> NM_004994 












<400> 165 














agacacctct 


gccctcacca 


tgagcctctg 


gcagcccctg 


gtcctggtgc 


tcctggtgct 


60 


gggctgctgc 


tttgctgccc 


ccagacagcg 


ccagtccacc 


cttgtgctct 


tccctggaga 


120 


cctgagaacc 


aatctcaccg 


acaggcagct 


ggcagaggaa 


tacctgtacc 


gctatggtta 


180 


cactcgggtg 


gcagagatgc 


gtggagagtc 


gaaatctctg 


gggcctgcgc 


tgctgcttct 


240 


ccagaagcaa 


ctgtccctgc 


ccgagaccgg 


tgagctggat 


agcgccacgc 


tgaaggccat 


300 


gcgaacccca 


cggtgcgggg 


tcccagacct 


gggcagattc 


caaacctttg 


agggcgacct 


360 


caagtggcac 


caccacaaca 


tcacctattg 


gatccaaaac 


tactcggaag 


acttgccgcg 


420 


ggcggtgatt 


gacgacgcct 


ttgcccgcgc 


cttcgcactg 


tggagcgcgg 


tgacgccgct 


480 


caccttcact 


cgcgtgtaca 


gccgggacgc 


agacatcgtc 


atccagtttg 


gtgtcgcgga 


540 


gcacggagac 


gggtatccct 


tcgacgggaa 


ggacgggctc 


ctggcacacg 


cctttcctcc 


600 


tggccccggc 


attcagggag 


acgcccattt 


cgacgatgac 


gagttgtggt 


ccctgggcaa 


660 


gggcgtcgtg 


gttccaactc 


ggtttggaaa 


cgcagatggc 


gcggcctgcc 


acttcccctt 


720 


catcttcgag 


ggccgctcct 


actctgcctg 


caccaccgac 


ggtcgctccg 


acggcttgcc 


780 


ctggtgcagt 


accacggcca 


actacgacac 


cgacgaccgg 


tttggcttct 


gccccagcga 


840 


gagactctac 


acccgggacg 


gcaatgctga 


tgggaaaccc 


tgccagtttc 


cattcatctt 


900 


ccaaggccaa 


tcctactccg 


cctgcaccac 


ggacggtcgc 


tccgacggct 


accgctggtg 


960 


cgccaccacc 


gccaactacg 


accgggacaa 


gctcttcggc 


ttctgcccga 


cccgagctga 


1020 


ctcgacggtg 


atggggggca 


actcggcggg 


ggagctgtgc 


gtcttcccct 


tcactttcct 


1080 


gggtaaggag 


tactcgacct 


gtaccagcga 


gggccgcgga 


gatgggcgcc 


tctggtgcgc 


1140 


taccacctcg 


aactttgaca 


gcgacaagaa 


gtggggcttc 


tgcccggacc 


aaggatacag 


1200 


tttgttcctc 


gtggcggcgc 


atgagttcgg 


ccacgcgctg 


ggcttagatc 


attcctcagt 


1260 


gccggaggcg 


ctcatgtacc 


ctatgtaccg 


cttcactgag 


gggcccccct 


tgcataagga 


1320 


cgacgtgaat 


ggcatccggc 


acctctatgg 










aaccaccacc 


acaccgcagc 


ccacggctcc 


cccgacggtc 


tgccccaccg 


gaccccccac 


1440 


tgtccacccc 


tcagagcgcc 


ccacagctgg 


ccccacaggt 


cccccctcag 


ctggccccac 


1500 


aggtcccccc 


actgctggcc 


cttctacggc 


cactactgtg 


cctttgagtc 


cggtggacga 


1560 


tgcctgcaac 


gtgaacatct 


tcgacgccat 


cgcggagatt 


gggaaccagc 


tgtatttgtt 


1620 


caaggatggg 


aagtactggc 


gattctctga 


gggcaggggg 


agccggccgc 


agggcccctt 


1680 


ccttatcgcc 


gacaagtggc 


ccgcgctgcc 


ccgcaagctg 


gactcggtct 


ttgaggagcc 


1740 


gctctccaag 


aagcttttct 


tcttctctgg 


gcgccaggtg 


tgggtgtaca 


caggcgcgtc 


1800 


ggtgctgggc 


ccgaggcgtc 


tggacaagct 


gggcctggga 


gccgacgtgg 


cccaggtgac 


1860 
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cggggccctc 


cggagtggca 


gggggaagat 


gctgctgttc 


agegggegge 


gectctggag 


1920 


gttcgacgtg 


aaggegcaga 


tggtggatcc 


ccggagcgcc 


agcgaggtgg 


accggatgtt 


1980 


ccccggggtg 


cctttggaca 


cgcacgacgt 


cttccagtac 


cgagagaaag 


cctatttctg 


2040 


ccaggaccgc 


ttctactggc 


gcgtgagttc 


ccggagtgag 


ttgaaccagg 


tggaccaagt 


2100 


gggctacgtg 


acctatgaca 


tectgeagtg 


ccctgaggac 


tagggctccc 


gtcctgcttt 


2160 


gcagtgccat 


gtaaatcccc 


actgggacca 


accctgggga 


aggagecagt 


ttgeeggata 


2220 


caaactggta 


ttctgttctg 


gaggaaaggg 


aggagtggag 


gtgggctggg 


ccctctcttc 


2280 


tcacctttgt 


tttttgttgg 


agtgtttcta 


ataaacttgg 


attctctaac 


cttt 2334 




<210> 166 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_004 994 












<400> 166 














ggccctctct 


tctcaccttt 


gttttttgtt 


ggagtgtttc 


taataaactt 


ggattctcta 


60 


<210> 167 














<211> 5329 














<212> DNA 














<213> Homo 


sapiens 












<220> 














<221> Modified base 












<222> 1 . . . 


, 5329 












<223> n = a, c,g, or t 












<300> 














<308> NM_005063 












<400> 167 














gtggtgtcgg 


tgteggcage 


atccccggcg 


ccctgctgcg 


gtegceggag 


ccctcggcct 


60 


ctgttctcct 


ccccctcccg 


cccttacctc 


cacgcgggac 


cgcccgcgcc 


agtcaactcc 


120 


tegcactttg 


cccctgcttg 


geageggata 


aaagggggct 


gaggaaatac 


cggacacgtc 


180 


cacccgttgc 


cagctctagc 


ctttaaattc 


ccggctcggg 


acctccacgc 


acegggctag 


240 


cgccgacaac 


cagctagegt 


gcaaggcgcc 


gcggctcagc 


gcgtaccggc 


gggcttcgaa 


300 


accgcagtcc 


tccggcgacc 


ccgaactccg 


ctccggagcc 


tcagccccct 


ggaaagtgat 


360 


cccggcatcg 


gagagecaag 


atgccggccc 


acttgetgea 


ggacgatatc 


tctagctcct 


420 


ataccaccac 


caccaccatt 


acagcgcctc 


cctccagggt 


ectgeagaat 


ggaggagata 


480 


agttggagac 


gatgcccctc 


tacttggaag 


acgacattcg 


ccctgatata 


aaagatgata 


540 


tatatgaccc 


cacctacaag 


gataaggaag 


gcccaagccc 


caaggttgaa 


tatgtctgga 


600 


gaaacatcat 


ccttatgtct 


ctgctacact 


tgggagcect 


gtatgggatc 


actttgattc 


660 


ctacctgcaa 


gttctacacc 


tggctttggg 


gggtattcta 


ctattttgtc 


agtgccctgg 


720 


gcataacagc 


aggagctcat 


cgtctgtgga 


gccaccgctc 


ttacaaagct 


cggctgcccc 


780 


taeggctett 


tctgatcatt 


gccaacacaa 


tggcattcca 


gaatgatgtc 


tatgaatggg 


840 


ctcgtgacca 


ccgtgcccac 


cacaagtttt 


cagaaacaca 


tgetgatect 


cataattccc 


900 


gacgtggctt 


tttcttctct 




ggctget tgt 


gcgcaaacac 


ccagctgtca 


960 


aagagaaggg 


gagtaegcta 


gacttgtctg 


acctagaagc 


tgagaaactg 


gtgatgttcc 


1020 


agaggaggta 


ctacaaacct 


ggcttgctgc 


tgatgtgctt 


catcctgccc 


acgcttgtgc 


1080 


cctggtattt 


ctggggtgaa 


acttttcaaa 


acagtgtgtt 


cgttgccact 


ttcttgegat 


1140 


atgctgtggt 


gettaatgee 


acctggctgg 


tgaacagtgc 


tgcccacctc 


tteggatate 


1200 


gtccttatga 


caagaacatt 


agcccccggg 


agaatatcct 


ggtttcactt 


ggagctgtgg 


1260 


gtgagggctt 


ccacaactac 


caccactcct 


ttccctatga 


ctactctgcc 


agtgagtacc 


1320 


gctggcacat 


caacttcacc 


acattcttca 


ttgattgeat 


ggccgccctc 


ggtctggcct 


1380 


atgaccggaa 


gaaagtctcc 


aaggccgcca 


tcttggccag 


gattaaaaga 


aceggagatg 


1440 


gaaactacaa 


gagtggctga 


gtttggggtc 


cctcaggttc 


ctttttcaaa 


aaccagccag 


1500 


gcagaggttt 


taatgtctgt 


ttattaacta 


ctgaataatg 


ctaccaggat 


gctaaagatg 


1560 


atgatgttaa 


cccattccag 


tacagtattc 


ttttaaaatt 


caaaagtatt 


gaaagecaac 


1620 
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aactctgcct 
ggcccattgt 
gcagctggtc 
ctaaaacgaa 
agtaggtggc 
ttcagcccag 
aaagcnaggt 
cgaaggaacc 
ccaccacata 
ggcaatgcta 
taagtttgta 
taacaaggag 
tggtaaaaac 
tcctctttct 
ctgtggcatt 
aaataaaata 
ttccaaagag 
atttcctctc 
ttctaagagc 
tactgagtga 
tccagtttct 
ttcactcatt 
aactgcctgg 
taccctgtct 
ccctggacag 
agctccctcc 
agagctagct 
cctgccacac 
agaaagcctc 
ggccagccac 
agggcatgct 
cagctggcag 
gggcaagaac 
ctgcaaagct 
tctgttcagc 
tggaaaaaaa 
catcaaagca 
cagtccactg 
gtagaggcta 
agtagaaaag 
cccttggaaa 
tatataaaca 
atctgtacca 
gtgtggctgt 
tctaaagctc 
tttctaacag 
tttctaatca 
cctgagaccc 
cacgatttgc 
cttaagtgcc 
tttttttttt 
gtttgctagt 
gttaaaaaag 
cactgacttg 
tctgttaaaa 
aatgcttttt 
tttttataat 
ttagtctata 
gccagacaaa 
cttttctgtg 



ttatgatgct 
cctccttttc 
agtctttgct 
tggtctttgc 
ttgagctaga 
gcttttgcta 
aaattgtcgg 
tctcgccatg 
gcatgcttcc 
attcaatgcc 
gtaaaagtgg 
atttcttagt 
agcagctcat 
gctctgccat 
tccaaacatc 
tatatacata 
ggatgtttgg 
cactgctgga 
gaacttcaca 
gttattggga 
ccttgaaatg 
tttttttgag 
ggggcagggt 
gtcccttttc 
gcagcctcct 
tgcacacaga 
gccactttca 
tgatggctca 
tctctccctc 
agagcaaaag 
gaatgccccc 
tgcaatgctt 
ccaagtgcct 
ttggctgaga 
aaactaacca 
caaaaacaga 
gaagtctggc 
ttgagcccca 
caggggttag 
gatctaaggg 
tgtctgctgg 
ataaaaacaa 
tgaccctaca 
gggtgtgggt 
aattcaagtg 
tccttgcttt 
cagtgttgca 
tactgaagtg 
aggattccct 
cacatttgat 
tttgggttaa 
atcttgggtg 
cgaggtggcc 
ctgtgtgacc 
tggggataat 
agaaagcatt 
aggaatgtcc 
tggttctcca 
atttgagaat 
attgggtggg 



aagctgatat 
actttaatcg 
cagtgtccag 
tccagataac 
gataaaacag 
gatggaatgg 
gggagagagt 
atcagacata 
tttctctcct 
gcaacatata 
tctctgctgg 
tcatatatca 
agaattttga 
cttcaggata 
caaaaaaagg 
tatacattgc 
aaaaaactct 
catgagatgg 
taggaaggga 
gtcttaataa 
agtaaaaact 
gagaaggggg 
taggaatctc 
tttgaccaga 
ttgtgtgtat 
atgctcaggg 
cgtggcctcc 
agacaaggct 
cctctctcat 
agggtttatt 
tgcttacttg 
gtagaagtag 
cacctcgaaa 
cctgggattt 
gcattcccta 
atttgagaac 
tttgctctat 
gtgctggaag 
cctggactaa 
aagatcactg 
tatttctaat 
ctttcacttc 
taaggctgga 
gggagtgtgt 
acacattaat 
ggggggtgtg 
tatgagcctg 
gctggtagaa 
tctgggcttc 
ggagggtgga 
aagatggttg 
tattctctgt 
atgttatgct 
ctgggcaagt 
aatactgacc 
ttgggatcct 
accatgaact 
agaaactgaa 
acataaacaa 
attttttccc 



tatttcttct 
ccctcctttc 
cttccaaagc 
tctctttcct 
aatcttctgg 
aaaagcaact 
tagcatgtat 
cagctgccta 
ggctcggggt 
gttgaggccg 
ggaagggttt 
agaagtcttg 
gtattccatg 
ttggttcttc 
gaaggattta 
ttagaacgtt 
gaaggagagg 
agaggctgag 
tctgagaaca 
actagatatt 
agaaggcttc 
tctctgttaa 
ttcactaccc 
tctttctctt 
tcagaggcag 
tcactgaacc 
gcagtgtctc 
ggcaaaccct 
gagaagccaa 
ttcagtcccc 
gtgagggtgc 
gaggaaacag 
ggaggccctg 
gagataccac 
cagcctaggg 
cttggaccac 
taagattgga 
ggaggaaggc 
aggcatcctt 
tagtttagtt 
tccacaggtc 
ttcctattgt 
tggcacctca 
ctgctgagta 
gataaactca 
ctggcaactt 
ccctcactcc 
aaaggggcct 
attctggaaa 
aataatttga 
tagcatttaa 
aagtgtagct 
ggtggttgcc 
cacttaacta 
tacctcaaag 
tcagcacagg 
tgatacgtcc 
tgaatccatt 
cgcattgcca 
tttttatgtg 



cttatcctct 
ccttattgcc 
ctagacaacc 
tgagctgttg 
gtagtcccct 
tcatttgaca 
gaatgtaagg 
cctaatgagg 
aaaaagtggc 
aggataaaga 
tcttttcttt 
aagttgggtg 
agctgctcat 
ccctcatagt 
aggaggtgaa 
aaactattag 
aggaattagt 
ggacaggatc 
cgttcagggg 
aggtccattc 
tctccacagt 
catctagcct 
tgattcttga 
ccctgaacgt 
tgatgacttg 
actgcttctc 
cacctacacc 
cccagaaaca 
gcgctcatgt 
tctctctggg 
cccgcctgag 
ttctcactgg 
ttccctggag 
aaaccctgct 
cagacaatag 
tcctgtccct 
aatgtacact 
ctttcttctg 
gtctttgagc 
ctgttgacct 
atcagatgcc 
aatcgtgtgc 
ggctgagggc 
aggaacacga 
gatctgatca 
agctcaggtg 
ctctgcagaa 
gagtggagga 
cttttgttag 
atgtatttga 
aatggaaaat 
caaataggtc 
agggcctcca 
taaggtgcct 
ggcagttttg 
aattctcaag 
gtgtgtccca 
ggagaagcgg 
cggaaacata 
ggatatagta 



ctctcttcta 
tcccaggcaa 
tttctgtagc 
tgagctttga 
gttgattatc 
caaagcttct 
atgagggaag 
acttcaagcc 
tgcggtgttt 
aaagacattt 
ttttctttaa 
tttccagaat . 
tacagttctt 
aataagatgg 
gtcgggtcaa 
agtatttccc 
tgggatgcca 
tataggcagc 
ttgagaaggt 
attaattagt 
gttgtgcccc 
aaagtataca 
ttcctggctc 
tttcttcttt 
ctgtccaggc 
ttttgaaagt 
cctgtgctcc 
tctctggccc 
tgagccagtg 
tcagaaccag 
tcagtgctct 
gaagaagcaa 
tcagggtgaa 
gaacacagtg 
tatagaagtc 
gtagctcagt 
accaaacact 
tgttaattgc 
tattcacctc 
gtgcacctac 
tgcttgataa 
catggatctg 
cccaatgtat 
ttttcaagat 
agagtccgga 
ccttacatct 
tccctttgca 
ttatcagtat 
ggctgctttt 
tttataagtt 
■tttctccttg 
atcatgaaag 
accactgtgc 
cagttttcct 
aggcatgact 
acctgagtat 
gatgctgtca 
tggataacta 
cagaggatgc 
gttacttgtg 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
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acaagaataa ttttggaata atttctatta atatcaactc tgaagctaat tgtactaatc 5280 

tgagattgtg tttgttcata ataaaagtga agtgaatctg attgcactg 5329 

<210> 168 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005063 
<400> 168 

aataatgcta ccaggatgct aaagatgatg atgttaaccc attccagtac agtattcttt 60 

<210> 169 

<211> 634 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005101 
<400> 169 

cggctgagag gcagcgaact catctttgcc agtacaggag cttgtgccgt ggcccacagc 60 

ccacagccca cagccatggg ctgggacctg acggtgaaga tgctggcggg caacgaattc 120 

caggtgtccc tgagcagctc catgtcggtg tcagagctga aggcgcagat cacccagaag 180 

attggcgtgc acgccttcca gcagcgtctg gctgtccacc cgagcggtgt ggcgctgcag 240 

gacagggtcc cccttgccag ccagggcctg ggccctggca gcacggtcct gctggtggtg 300 

gacaaatgcg acgaacctct gagcatcctg gtgaggaata acaagggccg cagcagcacc 360 

tacgaggtcc ggctgacgca gaccgtggcc cacctgaagc agcaagtgag cgggctggag 420 

ggtgtgcagg acgacctgtt ctggctgacc ttcgagggga agcccctgga ggaccagctc 480 

ccgctggggg agtacggcct caagcccctg agcaccgtgt tcatgaatct gcgcctgcgg 540 

ggaggcggca cagagcctgg cgggcggagc taagggcctc caccagcatc cgagcaggat 600 
caagggccgg aaataaaggc tgttgtaaga gaat 634 

<210> 170 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005101 
<400> 170 

tggtggtgga caaatgcgac gaacctctga gcatcctggt gaggaataac aagggccgca 60 

<210> 171 
<211> 1339 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_005139 
<400> 171 

gaattccgat tagtgtgatc tcagctcaag gcaaaggtgg gatatcatgg catctatctg 60 
ggttggacac cgaggaacag taagagatta tccagacttt agcccatcag tggatgctga 120 
agctattcag aaagcaatca gaggaattgg aactgatgag aaaatgctca tcagcattct 180 
gactgagagg tcaaatgcac agcggcagct gattgttaag gaatatcaag cagcatatgg 240 
aaaggagctg aaagatgact tgaagggtga tctctctggc cactttgagc atctcatggt 300 
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ggccctagtg 


actccaccag 


cagtctttga 


tgcaaagcag 


ct a 

g a 


cc gaaggg 




cgcgggaaca 


aacgaagatg 


ccttgattga 


aatcttaact 




gc ggcaaa 


420 


gaaggatatc 


tctcaagcct 


attatacagt 


atacaagaag 




9 9 


„ 


ttccgaaaca 


tctggtgact 


tccggaaagc 


tctgttgact 


ttggcagatg 


gcagaagaga 


540 


tgaaagtctg 


aaagtggatg 


agcatctggc 


caaacaagat 


gcccagattc 


a aaagc 


600 


tggtgagaac 


agatggggca 


cggatgaaga 


caaattcact 


gagatcctgt 




660 


ctttcctcaa 


ttaaaactaa 


catfctgatga 


atacagaaat 








ggacagcata 


aaaggagaat 


tatctgggca 


ttttgaagac 


ttactgttgg 


ccatagttaa 


7Rf) 


ttgtgtgagg 


aacacgccgg 


cctttttagc 






tgaagggtat 


840 


tggaactgat 


gagtttactc 


tgaaccgaat 


aatggtgtcc 


agatcagaaa 


ttgacctttt 


900 


ggacattcga 


acagagttca 


agaagcatta 


tggctattcc 


ctatattcag 


caattaaatc 


960 


ggatacttct 


ggagactatg 


aaatcacact 


cttaaaaatc 


tgtggtggag 


atgactgaac 


1020 




atctccaaag 


gtccacgatg 


ggctttccca 


acagctccac 


cttacttctt 


1080 


ctcatactat 


ttaagagaac 


aagcaaatat 


aaacagcaac 


ttgtgttcct 


aacaggaatt 


1140 


ttcattgttc 


tataacaaca 


acaacaaaag 


cgattattat 


tttagagcat 


ctcatttata 


1200 


atgtagcagc 


tcataaatga 


aattgaaaat 


ggtattaaag 


atctgcaact 


actatccaac 


1260 


ttatatttct 


gctttcaaag 


ttaagaatct 


ttatagttct 


actccattaa 


atataaagca 


1320 


agataataaa 


acggaattc 


1339 








<210> 172 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_005139 












<400> 172 














ttcagcaatt 


aaatcggata 


cttctggaga 


ctatgaaatc 


acactcttaa 


aaatctgtgg 


60 


<210> 173 














<211> 1582 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_005165 












<400> 173 














ccgagctgtg 


cttgtggctg 




ctggctgcgc 


acagggagct 


gtcaccatgc 


60 


ctcactcgta 


cccagccctt 




agaagaagga 




attgccctgc 


120 


ggattgtagc 


cccgggcaaa 


.. . 

gg c gg 


c gcgga ga 


g c g aggc 


ageatggeca 


180 


agcggctgag 


ccaaattggg 


g ggaaaaca 


cagaggagaa 


ccgccggctg 


taccgccagg 


240 


tcctgttcag 


tgctgatgac 


g g gaaaa 




aggcgtcatt 


ttcttccatg 


300 


agaccctcta 


ccagaaagat 


y gg g 


ttc^ctt ^ 


ccgaaccatc 


caggataagg 


360 


gcatcgtcgt 


gggcatcaag 




gtgtggtgcc 


c age ggg 


actgatggag 


420 


aaaccaccac 


tcaagggctg 


gatgggctct 






aagaaggatg 




gtgctgactt 


tgccaagtgg 


cgctgtgtgc 


tgaaaatcag 


tqaqcqtaca 


ccctctgcac 


540 


ttgccattct 


ggagaacgcc 


aacgtgctgg 




g gc 


cagcagaa g 




gcattgtgcc 


tattgtggaa 




tgcctgatgg 




etcaaaegtt 


ccn 


gtcagtatgt 


tacagagaag 


gtcttggctg 


ctgtgtacaa 


ggccctgagt 




79n 


tatacctgga 


ggggaccctg 


ctcaagccca 


acatggtgac 


cccgggccat 


gcctgtccca 


780 


tcaagtatac 


cccagaggag 


attgccatgg 


caactgtcac 


tgccctgcgt 


cgcactgtgc 


840 


ccccagctgt 


ccoaggagtg 


accttcctgt 


ctgggggtca 


gagegaagaa 


gaggcatcat 


900 


tcaacctcaa 


tgccatcaac 


cgctgccccc 


ttccccgacc 


ctgggcgctt 


accttctcct 


960 


atgggcgtgc 


cctgcaagcc 


tctgcactca 


atgcctggcg 


agggcaaegg 


gaeaatgetg 


1020' 


gggctgccac 


tgaggagttc 


atcaagcggg 


ctgaggtgaa 


tgggcttgea 


gcccagggca 


1080 


agtatgaagg 


cagtggagaa 


gatggtggag 


cagcagcaca 


gtcactctac 


attgccaacc 


1140 


atgcctactg 


agtatccact 


ccataccaca 


gcccttggcc 


cagccatctg 


cacccacttt 


1200 


tgcttgtagt 


catggccagg 


gccaaatagc 


tatgcagagc 


agagatgect 


tcacctggca 


1260 


ccaacttgtc 


ttcctttctc 


tcttcccttc 


ccctctctca 


ttgctgcacc 


tgggaccata 


1320 



79 



WO 2006/084272 PCT/LS2006/004280 



ggatgggagg atagggagcc cctcatgact gagggcagaa gaaattgcta gaagtcagaa 1380 

caggatggct gggtctcccc ctacctcttc cagctcccac aattttccca tgatgaggta 1440 

gcttctccct gggctctcct tcttgcctgc cctgtctcct gggatcagag ggtagtacag 1500 

aagccctgac tcatgccttg agtacatacc atacagcaaa taaatggtag caaaacaaaa 1560 
aaaaaaaaaa aaaaaaaaaa aa 1582 

<210> 174 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005165 
<400> 174 

gagggtagta cagaagccct gactcatgcc ttgagtacat accatacagc aaataaatgg 60 

<210> 175 
<211> 451 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_005213 
<400> 175 

acttccctgt tcactttggt tccagcatcc tgtccagcaa agaagcaatc agccaaaatg 60 
atacctggag gcttatctga ggccaaaccc gccactccag aaatccagga gattgttgat 120 
aaggttaaac cacagcttga agaaaaaaca aatgagactt atggaaaatt ggaagctgtg 180 
cagtataaaa ctcaagttgt tgctggaaca aattactaca ttaaggtacg agcaggtgat 240 
aataaatata tgcacttgaa agtattcaaa agtcttcccg gacaaaatga ggacttggta 300 
cttactggat accaggttga caaaaacaag gatgacgagc tgacgggctt ttagcagcat 360 
gtacccaaag tgttctgatt ccttcaactg gctactgagt catgatcctt gctgataaat 420 
ataaccatca ataaagaagc attcttttcc a 451 

<210> 176 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005213 



<400> 176 

aactggctac tgagtcatga tccttgctga taaatataac catcaataaa gaagcattct 60 

<210> 177 

<211> 366 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005218 
<400> 177 

gtcagctcag cctccaaagg agccagcctc tccccagttc ctgaaatcct gagtgttgcc 60 

tgccagtcgc catgagaact tcctaccttc tgctgtttac tctctgctta cttttgtctg 120 

agatggcctc aggtggtaac tttctcacag gccttggcca cagatctgat cattacaatt 180 

gcgtcagcag tggagggcaa tgtctctatt ctgcctgccc gatctttacc aaaattcaag 240 

gcacctgtta cagagggaag gccaagtgct gcaagtgagc tgggagtgac cagaagaaat 300 

gacgcagaag tgaaatgaac tttttataag cattctttta ataaaggaaa attgcttttg 360 
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aagtat 366 

<210> 178 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005218 
<400> 178 

gggagtgacc agaagaaatg acgcagaagt gaaatgaact ttttataagc attcttttaa 60 

<210> 179 
<211> 1519 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_005326 
<400> 179 

ctgcctcgga acgctgtccc ccgcagcgac ggcccgttcc acctcgcgat ctgccgggta 60 

cccgggcggc gtggcgctcg gcctccaggg atccactgtg cggtgccaaa aaagaggcgg 120 

aggctcgcgg cacagctctc ccggcgcagc tctcgggccg ccgccgccgc tcccaggccc 180 

gtctcccggc ccgtggcagt cggggctcg'c ggacaaaaca agttgagcgc gagcgcgttg 240 

attggttggc ggacggtgcg aggtggacgc tgattggctg agggcagcgc gaggcgggcg 300 

ctgattggct gcgacgcgcc gacgccggtg ttttgcagtc ctgggcagct cggcagtcca 360 

gcccggcccg ggtcatggtg gtgggccgag ggctgctcgg ccgccgcagc ctcgccgcgc 420 

tgggagccgc ctgcgcccgc cgaggcctcg gtccagccct gctgggagtt ttctgccaca 480 

cagatttgcg gaagaacctg accgtggacg agggcaccat gaaggtagag gtgctgcctg 540 

ccctgaccga caactacatg tacctggtca ttgatgatga gaccaaggag gctgccattg 600 

tggatccggt gcagccccag aaggtcgtgg acgcggcgag aaagcacggg gtgaaactga 660 

ccacagtgct caccacccac caccactggg accatgctgg' cgggaatgag aaactggtca 720 

agctggagtc gggactgaag gtgtacgggg gtgacgaccg tatcggggcc ctgactcaca 780 

agatcactca cctgtccaca ctgcaggtgg ggtctctgaa cgtcaagtgc ctggcgaccc 840 

cgtgccacac ttcaggacac atttgttact tcgtgagcaa gcccggaggc tcggagcccc 900 

ctgccgtgtt cacaggtgac accttgtttg tggctggctg cgggaagttc tatgaaggga 960 

ctgcggatga gatgtgtaaa gctctgctgg aggtcttggg ccggctcccc ccggacacaa 1020 

gagtctactg tggccacgag tacaccatca acaacctcaa gtttgcacgc cacgtggagc 1080 

ccggcaatgc cgccatccgg gagaagctgg cctgggccaa ggagaagtac agcatcgggg 1140 

agcccacagt gccatccacc ctggcagagg agtttaccta caaccccttc atgagagtga 1200 

gggagaagac ggtgcagcag cacgcaggtg agacggaccc ggtgaccacc atgcgggccg 1260 

tgcgcaggga gaaggaccag ttcaagatgc cccgggactg , aggccgccct gcaccttcag 1320 

cggatttggg gattaggctc ttttaggtaa ctggctttcc tgctggtccg tgcgggaaat 1380 

tcagtcttga tttaacctta attttacagc ccttggcttg tgttatcgga cattctaatg 1440 

catatttata agagaagttt aacaagtatt tattcccata aaaaaaaaaa aaaaaaaaaa 1500 
aaaaaaaaaa aaaaaaaaa 1519 

<210> 180 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005326 
<400> 180 

cttgtgttat cggacattct aatgcatatt tataagagaa gtttaacaag tatttattcc 60 
<210> 181 



81 



WO 2006/084272 



PCT/LS2006/004280 



<211> 3378 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_005461 
<400> 181 

acagctgcac cgccgagctg cgagcggctg cgagcgagag agcgtaagag caagagagct 60 

agagagcgag caacgggcac tcgccccacg cctcccctca gccccaccgc gcgctccgct 120 

tgcctctcca ccccgcccga ctctacccgg cccggtccct gcgcgggcac agcccagagc 180 

tctggggcgg tgcaggcagc ctcgggactc tccggcgcgc cgccgcgtcc ccagacaaag 24 0 

gcttggccgg cggccccggc ccgctgcgcc ctcgctcccc gcctccccag ctcttctccg 300 

ctcttccccc ccgcgcttgg ctcggcgcgc tccggccggc cgcaaagttt cccgggcggc 360 

■agcggcggct gcgcctcgct tcagcgatgg ccgcggagct gagcatgggg ccagagctgc 420 

ccaccagccc gctggccatg gagtatgtca acgacttcga cctgctcaag ttcgacgtga 480 

agaaggagcc actggggcgc gcggagcgtc cgggcaggcc ctgcacacgc ctgcagccag 540 

ccggctcggt gtcctccaca ccgctcagca ctccgtgtag ctccgtgccc tcgtcgccca 600 

gcttcagccc gaccgaacag aagacacacc tcgaggatct gtactggatg gcgagcaact 660 

accagcagat gaaccccgag gcgctcaacc tgacgcccga ggacgcggtg gaagcgctca 720 

tcggctcgca cccagtgcca cagccgctgc aaagcttcga cagctttcgc ggcgctcacc 780 

accaccacca tcaccaccac cctcacccgc accacgcgta cccgggcgcc ggcgtggccc 840 

acgacgagct gggcccgcac gctcacccgc accatcacca tcatcaccaa gcgtcgccgc 900 

cgccgtccag cgccgctagc ccggcgcaac agctgcccac tagccacccc gggcccgggc 960 

cgcacgcgac ggcctcggcg acggcggcgg gcggcaacgg cagcgtggag gaccgcttct 1020 

ccgacgacca gctcgtgtcc atgtccgtgc gcgagctgaa ccgccacctg cggggcttca 1080 

ccaaggacga ggtgatccgc ctgaagcaga agcggcggac cctgaagaac cggggctacg 1140 

cccagtcttg caggtataaa cgcgtccagc agaagcacca cctggagaat gagaagacgc 1200 

agctcattca gcaggtggag cagcttaagc aggaggtgtc ccggctggcc cgcgagagag 1260 

acgcctacaa ggtcaagtgc gagaaactcg ccaactccgg cttcagggag gcgggctcca 1320 

ccagcgacag cccctcctct cccgagttct ttctgtgagt cgtggccggt cctggccccc 1380 

gcccttgccc cggcccggac tccctgtccc acgtccctag tcccagacta ccccggaccc 1440 

tgtccctgcc gcggccccag ccttgacctg tttgacttga gcgagaggga ggaagggcgc 1500 

gcgggccgcg ggcgacgggc gggtgcgcgg gcgggcaggg gaccttggct aaggcgagag 1560 

tagcgcacgc cagcgccgcc tcctagactc gagcagagcc ggagagagag acgagagggt 1620 

gggaggtccc ggagtaactt ctctccaggc tgaagggcgg cgaggcatag tcccgagaag 1680 

tcaccaaggc catctggaga ctcctggctt tctgaacttt gcgcgttaag ccgggacagc 1740 

tgctttgctg cccggagagt agtccgcgcc aggaagagag caacgaggaa aggagaggga 1800 

ctctggcgtc ccggcaggcg agaggcgagg ctgagcgaaa gaaggaagga cagacggacc 18 60 

tgtctgtcag agttcggaga acactggctc tcagccctga gacacaggcc tcagttagga 1920 

cgctcggcgc ccaaatctca tcagttttat tgcctgctcg attatataga aaaatacaaa 1980 

aaatctgcat taaaaatatt aatcctgcat gctggacatg tatggtaata atttctattt 2040 

tgtaccattt tcttgtttaa ctttagcatg ttgttgatca tggatcatac tccccttgtt 2100 

tctttgggtg agaagggatc gcagtttgga aactccggcg gctgcgtgcg gggtttcagt 2160 

cccagctgta ggcttgtaaa tacccgcccc gccaaaccgc atagagaacg tggcagcaag 2220 

ctgagggtct ttgtttgggt ttattattac ggtatttttg tttgtaagtt aaaaagaaaa 2280 

aaaaaaagaa aaagttccgg gcattttgca tcagaaaaca actttgtctt ggggcacact 234 0 

tggaagttgc atgttttctt tccttccctt atccccattc ggtcctcttt ttcctctctc 2400 

gctttagttt tcaaccttgt tggtgctgag agagagaacc gagaggtccc agtacaaggg 24 60 

cagggcaggg cagggaagct gccaagctcc gcaccccaga ggagtgttct ggactacagc 2520 

cttgtcttat ggtcaaattg atacccttaa taagaaagga aaggaaagga aaacagatcc 2580 

tcccctctgc tttttattgt aaccagaatc accctgaggt cccttctgaa ccctctgggc 2640 

ctgcgctaat tgtaggagcc acagcgctcc tagggtgaga ggcttagcca tccctgaccc 2700 

tggcagtgca ctggtaagca gacactgcac tgaaccaact gctatgctca gaatgtacca 27 60 

gaaacccaaa cattggcaag taattttgca actttcaagt gcgttcttta gaccaatgca 2820 

ttgcgtttct ttccctgctt ttgagatagt aggaagagtt cttggtggtg tcccccccct 2880 

tcaattcttc agttgtatag tagttatagg gaagatatgg gtgtttttct ttattattac 2940 

tttttttttt ctgcaggtca gtaaaaggat ttaagttgca ctgacaaaaa taccaaaata 3000 

aaagtgtatt tttaagttcc catttgaaat tgctggcgct gctggccgga tgcatttttg 3060 

agtttgtatt agttgataaa ttaacagtaa taacaagatt gtatgaaccg catggtgctt 3120 

gcagttttaa atattgtgga tatttgtcct gcatcagaaa cgagctttgg tttttacaga 3180 
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ttcaactgtg ttgaaatcaa acctgccgca acagaaattg tttttatttc atgtaaaata 3240 

agggatcaat ttcaaaccct gcttatgata tgaaaatatt aaaacctagt ctattgtagt 3300 

tttattcaga ctggtttctg ttttttggtt attaaaatgg tttcctattt tgcttattaa 3360 

aaaaaaaaaa aaaaaaaa 3378 

<210> 182 
<2U> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_005461 
<400> 182 

atttgtcctg catcagaaac gagctttggt ttttacagat tcaactgtgt tgaaatcaaa 60 

<210> 183 
<211> 597 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_005532 
<400> 183 

agctgaagtt gaggatctct tactctctaa gccacggaat 
gcctctgctc tcacctcatc agcagtgacc agtgtggcca 
ggctctgccg tagttttgcc cctggccagg attgctacag 
gccatggcgg ctgtgcccat ggtgctcagt gccatgggct 
tcgtcctcca tagcagccaa gatgatgtcc gcggcggcca 
gcctcgggca gccttgtggg tactctgcag tcactgggag 
accaagttca tcctgggctc cattgggtct gccattgcgg 
tagctccctg cccctcgccc tgcagagaag agaaccatgc 
catcctgacc cagcgaggag ccaactatcc caaatatacc 
ctgcatctcc agaggaaaat aagaaataaa gatgaattgt 

<210> 184 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_005532 
<400> 184 

agccaactat cccaaatata cctgggtgaa atataccaaa ttctgcatct ccagaggaaa 60 

<210> 185 
<211> 1661 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_005566 
<400> 185 

tgctgcagcc gctgccgccg 
acgtgcattc ccgattcctt 
gatttataat cttctaaagg 
tggtgctgtt ggcatggcct 
tgctcttgtt gatgtcatcg 



taacccgagc aggcatggag 60 

aagtggtcag ggtggcctct 120 

ttgtgattgg aggagttgtg 180 

tcactgcggc gggaatcgcc 240 

ttgccaatgg gggtggagtt 300 

caactggact ctccggattg 360 

ctgtcattgc gaggttctac 420 

caggggagaa ggcacccagc 480 

tgggtgaaat ataccaaatt 540 
tgcaactctt aaaaaaa 597 



attccggatc tcattgccac gcgcccccga cgaccgcccg 60 

ttggttccaa gtccaatatg gcaactctaa aggatcagct 120 

aagaacagac cccccagaat aagattacag ttgttggggt 180 

gtgccatcag tatcttaatg aaggacttgg cagatgaact 240 

aagacaaatt gaagggagag atgatggatc tccaacatgg 300 
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cagccttttc 


cttagaacac 


caaagattgt 


ctctggcaaa 


gactataatg 


taactgcaaa 


360 


ctccaagctg 


gtcattatca 


cggctggggc 


acgtcagcaa 


gagggagaaa 


gccgtcttaa 


420 


tttggtccag 


cgtaacgtga 


acatatttaa 


attcatcatt 


cctaatgttg 






cccgaactgc 


aagttgctta 


ttgtttcaaa 


tccagtggat 


atcttgacct 


acgtggcttg 


540 


gaagataagt 


ggttttccca 


aaaaccgtgt 


tattggaagt 


ggttgcaatc 


t gga 1 1 cage 


600 


ccgattccgt 


tacctgatgg 


gggaaaggct 


gggagttcac 


ccattaagct 


gtcatgggtg 




ggtccttggg 


gaacatggag 


attccagtgt 


gcctgtatgg 






7?n 


tgtctctctg 


aagactctgc 


acccagattt 


agggactgat 


aaagataagg 


ac t 


780 


agaggttcac 


aagcaggtgg 


ttgagagtgc 


ttatgaggtg 


atcaaactca 


t 


840 


atcctgggct 


attggactct 


ctgtagcaga 






agaatcttag 


900 


gcgggtgcac 


ccagtttcca 


ccatgattaa 


gggtctttac 


ggaataaagg 


atgatgtctt 


960 


ccttagtgtt 


ccttgcattt 


tgggacagaa 


tggaatctca 


gaccttgtga 






gacttctgag 


gaagaggccc 


gtttgaagaa 


gagtgcagat 


acactttggg 


ggatccaaaa 


1080 


ggagctgcaa 


ttttaaagtc 


ttctgatgtc 


atatcatttc 


actgtctagg 




1140 


attctaggtg 


gaggttgtgc 


atgttgtcct 


ttttatctga 


tctgtgatta 


aagcagtaat 


1200 


attttaagat 


ggactgggaa 


aaacatcaac 


tcctgaagtt 


agaaataaga 


atggtttgta 


1260 




aattct° Ct9 


atgctggatg 


gtattaatct 


tgtgtagtct 


tcaactggtt 


1320 


agtgtgaaat 


9 






atgctgtacg 


tactgeattt 






ccaggtggat 


gtttaccgtg 


4- i. J- a i- ^ i- = a 

cgi: Latauaa 


cttcctggct 


ccttcactga 


1440 


acatgcctag 


tccaacattt 


tttcccagtg 


agtcacatcc 


tgggatccag 


tgtataaatc 


1500 


caatatcatg 


tcttgtgcat 


aattcttcca 


aaggatctta 


ttttgtgaac 


tatatcagta 


1560 






taaaaagatc 


tacatacaaa 


caatgcaacc 


aactatccaa 


1620 


gtgttatacc 


aactaaaacc 


cccaataaac 


cttgaacagt 


g 1661 






<210> 186 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_005566 












<400> 186 














catcaactcc tgaagttaga 


aataagaatg 


gtttgtaaaa 


tccacagcta 


tatcctgatg 


60 


<210> 187 














<211> 2993 














<212> DNA 














<213> Homo sapiens 












<300> 














<308> NM_005689 












<400> 187 














gggcctgcag 


ttggcagaag 






c gggg cc gtg 


ctgagaegge 




gtacgtgcoc 


tgcgtgagtg 




gcgcgtgcgc 


taggggagtg 


ggcggtgagg 




cctggtccac 


gtgcgtccct 


tcccgggacc 


cccgcagctt 


ggcgcccagc 


ggctacgtga 




gccaaggcac 


ccggatgtcc 


gcgcccctct 






cegg cccg 




cagtgcccgc 


agcctcggcc 


ggcgtccacg 


cattgccatg 








cgaggccgaa 


gggcccgtgg 


gtccggcctg 










cttcacgctc 


gtgccctcga 


cgcggatggc 


tctagggact 


ctggccttgg 


g gg c 




tccctgcaga 


cgccgggagc 


ggcccgctgg 


tgctgattcg 




gggccggccc 


480 


tcgcatctct 


ccctacgtgc 


tgcagctgct 


tctggccaca 


cttcaggcgg 


cgctgcccct 


540 


ggccggcctg 


gctggccggg 


tgggcactgc 


ccggggggcc 


ccactgocaa 


gctatctact 


600 


tctggcctcc 


gtgctggaga 


gtctggccgg 


cgcctgtggc 


ctgtggctgc 


ttgtcgtgga 


660 


gcggagccag 


gcacggcagc 


gtctggcaat 


gggcatctgg 


atcaagttca 


ggcacagccc 


720 


tggtctcctg 


ctcctctgga 


ctgtggcgtt 


tgcagctgag 


aacttggccc 


tggtgtcttg 


780 


gaacagccca 


cagtggtggt 


gggcaagggc 


agacttgggc 


caacaggttc 


agtttagcct 


840 


gtgggtgctg 


cggtatgtgg 


tctctggagg 


gctgtttgtc 


ctgggtctct 


gggcccctgg 


900 


acttcgtccc 


cagtcctata 


cattgcaggt 


tcatgaagag 


gaccaagatg 


tggaaaggag 


960 


ccaggttcgg 


tcagcagccc 


aacagtctac 


ctggcgagat 


tttggcagga 


agctccgcct 


1020 




WO 2006/084272 PCT/US2006/004280 



cctgagtggc tacctgtggc ctcgagggag tccagctctg cagctggtgg tgctcatctg 1080 

cctggggctc atgggtttgg aacgggcact caatgtgttg gtgcctatat tctataggaa 1140 

cattgtgaac ttgctgactg agaaggcacc ttggaactct ctggcctgga ctgttaccag 1200 

ttacgtcttc ctcaagttcc tccagggggg tggcactggc agtacaggct tcgtgagcaa 1260 

cctgcgcacc ttcctgtgga tccgggtgca gcagttcacg tctcggcggg tggagctgct 1320 

catcttctcc cacctgcacg agctctcact gcgctggcac ctggggcgcc gcacagggga 1380 

ggtgctgcgg atcgcggatc ggggcacatc cagtgtcaca gggctgctca gctacctggt 1440 

gttcaatgtc atccccacgc tggccgacat catcattggc atcatctact tcagcatgtt 1500 

cttcaacgcc tggtttggcc tcattgtgtt cctgtgcatg agtctttacc tcaccctgac 1560 

cattgtggtc actgagtgga gaaccaagtt tcgtcgtgct atgaacacac aggagaacgc 1620 

tacccgggca cgagcagtgg actctctgct aaacttcgag acggtgaagt attacaacgc 1680 

cgagagttac gaagtggaac gctatcgaga ggccatcatc aaatatcagg gtttggagtg 1740 

gaagtcgagc gcttcactgg ttttactaaa tcagacccag aacctggtga ttgggctcgg 1800 

gctcctcgcc ggctccctgc tttgcgcata ctttgtcact gagcagaagc tacaggttgg 1860 

ggactatgtg ctctttggca cctacattat ccagctgtac atgcccctca attggtttgg 1920 

cacctactac aggatgatcc agaccaactt cattgacatg gagaacatgt ttgacttgct 1980 

gaaagaggag acagaagtga aggaccttcc tggagcaggg ccccttcgct ttcagaaggg 2040 

ccgtattgag tttgagaacg tgcacttcag ctatgccgat gggcgggaga ctctgcagga 2100 

cgtgtctttc actgtgatgc ctggacagac acttgccctg gtgggcccat ctggggcagg 2160 

gaagagcaca attttgcgcc tgctgtttcg cttctacgac atcagctctg gctgcatccg 2220 

aatagatggg caggacattt cacaggtgac ccaggcctct ctccggtctc acattggagt 2280 

tgtgccccaa gacactgtcc tctttaatga caccatcgcc gacaatatcc gttacggccg 234 0 

tgtcacagct gggaatgatg aggtggaggc tgctgctcag gctgcaggca tccatgatgc 2400 

cattatggct ttccctgaag ggtacaggac acaggtgggc gagcggggac tgaagctgag 24 60 

cggcggggag aagcagcgcg tcgccattgc ccgcaccatc ctcaaggctc cgggcatcat 2520 

tctgctggat gaggcaacgt cagcgctgga tacatctaat gagagggcca tccaggcttc 2580 

tctggccaaa gtctgtgcca accgcaccac catcgtagtg gcacacaggc tctcaactgt 2640 

ggtcaatgct gaccagatcc tcgtcatcaa ggatggctgc atcgtggaga ggggacgaca 2700 

cgaggctctg ttgtcccgag gtggggtgta tgctgacatg tggcagctgc agcagggaca 2760 

ggaagaaacc tctgaagaca ctaagcctca gaccatggaa cggtgacaaa agtttggcca 2820 

cttccctctc aaagactaac ccagaaggga ataagatgtg tctcctttcc ctggcttatt 2880 

tcatcctggt cttggggtat ggtgctagct atggtaaggg aaagggacct ttccgaaaaa 2940 
catcttttgg ggaaataaaa atgtggactg tgaaaaaaaa aaaaaaaaaa aaa 2993 

<210> 188 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005689 
<400> 188 

ggaaagggac ctttccgaaa aacatctttt ggggaaataa aaatgtggac tgtgaaaaaa 60 

<210> 189 
<211> 1830 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_005749 
<400> 189 

ggggagttga aacctaattt tgtggcgtag cagctatgca gcttgaaatc caagtagcac 60 
taaattttat tatttcgtat ttgtacaata agcttcccag gagacgtgtc aacatttttg 120 
gtgaagaact tgaaagactt cttaagaaga aatatgaagg gcactggtat cctgaaaagc 180 
catacaaagg atcggggttt agatgtatac acatagggga gaaagtggac ccagtgattg 240 
aacaagcatc caaagagagt ggtttggaca ttgatgatgt tcgtggcaat ctgccacagg 300 
atcttagtgt ttggatcgac ccatttgagg tttcttacca aattggtgaa aagggaccag 360 
tgaaggtgct ttacgtggat gataataatg aaaatggatg tgagttggat aaggagatca 420 
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aaaacagctt 


taacccagag 


gcccaggttt 


ttatgcccat 


aagtgaccca 


gcctcatcag 


480 


tgtccagctc 


tccatcgcct 


ccttttggtc 


actctgctgc 


tgtaagccct 


accttcatgc 


540 


cccggtccac 


tcagccttta 


acctttacca 


ctgccacttt 








ctaccaaaat 


gaagaatagt 


ggccgtagca 


acaaggttgc 


acgtacttct 


cccatcaacc 


660 


tcggcttgaa 


tgtgaatgac 




agaaagccat 


ctcttcctca 






tgtatgggct 




agccagcagc 






ccagcccagc 


780 


cgccaccgcc 


accaccacca 


ccacagcagc 


aacaacaaca 




gctctttctc 


840 


ctaatgccaa 


ggaatttatt 


tttcctaata 


ta t 


t 


aCCaa j^^ aa 






ctata 50 ^ 0 ^ 


cttaacctca 


gtcctctcca 


? 4. t 

g acag aa 






tgtttgcagc 






agtcttttgt 


aga ggc g 


a g 


Ji 






f 

aaccagcaa 


fcccagcctgt 


tatggctaac 


taaaaaaaag 




aaaatgtatc 


gtacaagtta 




gcccaagggg 


gatttttttt 


ttcacctcct 


1 1 AC\ 


tgagaatttt 


tttttttaag 




ggatacattc 


aagcttggtt 


aaaaaaataa 


1200 




tgcatcattt 


ca gcc 


aaccaagcac 


aaagttattt 


tatactgact 




gtatatttta 


aagtatactc 


tcagatatgg 


cctcttacag 


tatttaagat 


atagcaagga 


1320 


catggctgat 


ttttttttat 


aaaaattggc 


actaataagt 


gggtttattg 


gtcttttcta 


1380 


attgtataat 


ttaatttagt 


acaaagtttg 


taaaatatca 


gaggatatat 


atatattgtt 


1440 


tctacgacat 


ggtattgcat 


ttatatcttt 


ttactacagt 


gatctgtgac 


agcagcagct 


1500 


tcatgttgta 


ttttttttac 


tgaaattgta 


aaatatccat 


cttaaagaca 


tcaactattc 


1560 


taaaaattgt 


gtacaggata 


ttcctttagt 


ggtggaatta 


aaatgtacga 


atacttgctt 


1620 


tttcaaaaaa 


atgtattttc 


tgttaaaagt 


ttaaagattt 


ttgctatata 


ttatggaaga 


1680 


aaaatgtaat 


cgtaaatatt 


aattttgtac 


ctatattgtg 


caatacttga 


aaaaaacggt 


1740 


ataaaagtat 


tttgagtcag 


tgtcttacat 


gttaagaggg 


actgaaatag 


tttatattaa 


1800 


gtttgtatta 


aaattcttta 


aaattaaaaa 


1830 








<210> 190 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_005749 












<400> 190 














aaacctctgc 


tctttctcct 


aatgccaagg 


aatttatttt 


tcctaatatg 


cagggtcaag 


60 


<210> 191 














<211> 1534 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_005804 













<400> 191 

ggaagcgcag caactcgtgt ctgagcgccc 
agcagcgcgc ggagaagaac ggggagccag 
tcttttggat tacgatgaag aggaagagcc 
ccctaagaaa gacatcaagg gatcctacgt 
tctgctgaag ccggagctcc tgcgggccat 
ggtccagcat gagtgcattc cccaggccat 
gtccgggatg ggcaagacag cggtcttcgt 
caacggacag gtgaoggtcc tggtcatgtg 
caaggaatat gagcgctttt ccaagtacat 
tggtctctcc atcaagaagg atgaagaagt 
ggggaccccg ggccgcatcc tggcgctcgt 
gaagcacttt gtgctggacg agtgtgacaa 
tgtgcaggag atcttccgcc tgacaccaca 
cctgagcaag gacatccggc ctgtgtgcag 
tgtggacgac gagaccaagc tcacgctgca 



ggcggaaaac cgaagttgga agtgtctctt 60 

catcatggca gaacaggatg tggaaaacga 120 

ccaggctcct caagagagca caccagctcc 180 

ttccatccac agctctggct tccgggactt 240 

cgtggactgt ggctttgagc atccttctga 300 

cctgggcatg gacgtcctgt gccaggccaa 3 60 

gctggccacc ctacagcaga ttgagcctgt 4 20 

ccacacgagg gagctggcct tccagatcag 4 80 

gcccagcgtc aaggtgtctg tgttcttcgg 540 

gttgaagaag aactgtcccc atgtcgtggt 600 

gcggaatagg agcttcagcc taaagaatgt 660 

gatgctggag cagctggaca tgcggcggga 7 20 

cgagaagcag tgcatgatgt tcagcgccac 7 80 

gaagttcatg caggatccca tggaggtgtt 84 0 

cggcctgcag cagtactacg tcaaactcaa 900 
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agacagtgag 


aagaaccgea 


agctctttga 


tctcttggat 


gtgctggagt 


ttaaccaggt 


960 


gataatcttc 


gtcaagtcag 


tgcagcgctg 


catggccctg 


gcccagctcc 


tcgtggagca 


1020 


gaacttcccg 


gccatcgcca 


tccaccgggg catggcccag 


gaggagegee 


tgtcaegcta 


1080 


tcagcagttc 


aaggatttcc 


ageggeggat 


cctggtggcc 


accaatctgt 


ttggccgggg 


1140 


gatggacatc 


gagegagtea 


acategtett 


taactacgac 


atgectgagg 


actcggacac 


1200 






gggegggteg ctttggcacc 


aaaggectag 


ccatcacttt 




tgtgtctgac 


gagaatgatg 


ccaaaatcct 


caatgaegtc 


caggaceggt 


ttgaagttaa 


1320 


tgtggcagaa 


cttccagagg 


aaatcgacat 


ctccacatac 


atcgagcaga 


gccggtaacc 


1380 


accacgtgcc 


agagccgccc 


acccggagcc 


gcccgcatgc 


agcttcacct 


cccctttcca 


1440 




ttgagaagct 


agagattgta 


tgagaataaa 


cttgttatta 


tggaaaaaaa 


1500 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaa 1534 








<210> 192 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_005804 












<400> 192 














gttgagaagc tagagattgt 


atgagaataa 


agtgttatta 


tgaaatgaag 


aagcctcacc 


60 


<210> 193 














<211> 1416 














<212> DNA 














<213> Homo 


sapiens 












<220> 














<221> Modified base 












<222> 1 . . . 


, 1416 












<223> n = a, c,g, or t 












<300> 














<308> NM_005945 












<400> 193 














aggaattccg 


gaattccgga 


attccgatgg 


atggaacaga 


aaataaatct 


aagtttggtg 


60 


cgaacgccat 


tctgggggtg 


tcccttgccg 


tetgeaaage 


tggtgccgtt 


gagaaggggg 


120 


tcccctgtac 


cgccacatcg 


cgtacttggc 


tggcaacttc 


gaagtcatcc 


tgccagtccc 


180 


ggcgttcaag 


tgtcatcatc 


aatggcggtt 


etcatgetgg 


caacaagctg 


gecatgeaga 




gtctgtcctc 


ecagteggtg 


cagcaaactc agggaageca 


gecgea g 


gagcagaggt 


Tnn 


ttaccacaac 


ctgaagaatg 


tcatcaagga 


gaaatatggg 




ccaatgtggg 


Ten 


gatttgegeg 


ggtttgctcc 


caacatcctg 


gagaataaag 


aaggcctgga 


getgetgaag 


420 


actgetattg 


gaaagcctgg 


cctacactgt 


aaaggtggtc 


atggcatgga 


cgtagcggcc 




tccgagttct 


tcaggtcagg 


gaactatgac 


ctggacttca 


ag cccga 


tgaccccagc 


s/m 


aggtacatct 


cgcctgacca 


gctggctgac 


ctgtacaagt 


cc ca caa 


ggactaccca 


600 


gtggtgtcta 


tcgaagatcc 


ctttgaccag 


gatgactggg 


gagcttcaga 


agttcacagc 


660 


cagtgcagga 


atccaggtag 


tggggggatg 


actcacagtg 


accaacccaa 


agaggatege 


720 


caaggcgtga 


acgagaagtc 


ctgcaactgc 


ctcctgctca 


aagtcaacca 


gattggctcc 


780 


gtgaccgagt 


ctcttcaggc 


gtgcaagctg 


gcccaggcca 


atggttgggg 


cgtcatggtg 


840 


tetcategtt 


egggggagae 


tgaagatacc 


ttcatcgctg 




ggggctgtgc 




actggggcag 


atcaagactg 


gtgccccttg 


ccgatcacgc 


gcttggccaa 




960 


ctcctcagaa 


ttgaagagga 


gctgggcagc 


aaggctaagt 


ttgccggcag 


gaacttcaga 


1020 


aacoccttgg 


ccaagtaagc 


tgtgggcagg 


caagccttcg 


gtcacctgtt 


ggctacagac 


1080 


ccctcccctg 


gtgtcagctc 


aggcagctcg 


aggcccccga 


ccaacacttg 


caggggtccc 


1140 


tgctagttag 


cgcccaccgc 


cgtggagttc gtaccgcttc 


cttagaactc 


tacagaagee 


1200 


aagctccctg 


gaagccctgt 


tggcagctct 


agetttgeag 


ttgtgtaatt 


ggcccaagtc 


1260 


attgtttttc 


tcgccttact 


ttccaccaag tgtctagagt 


catgtgagcc 


tngtgtcatc 


1320 


tccggggtgg 


ccacaggcta 


gatccccggt ggttttgtgc 


tcaaaataaa 


aagcctcagt 


1380 


gacccatgaa 


aaaaaaaaag 


gaattccgga 


attccg 1416 
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<210> 194 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005945 
<400> 194 

ttgtgtaatt ggcccaagtc attgtttttc tcgccttact ttccaccaag tgtctagagt 60 

<210> 195 
<211> 961 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_006014 
<400> 195 

ggcgaccacg gtgtcttcaa aagccccgtc agggttggct tcctggggcc ggaccgactg 60 

tgggtcagtt tgcaccagcg ctctggaatc gagttacgcg cgaaagggca gagtttctgg 120 

aggaaaccgc agcctctcaa ccgctgaccg ggtctcagaa ggcccccggc agggccgctt 180 

ggcgggaact gaccacgcgc cagtcaggct ctccagggac ctgcgcaggc gcgtgtgggc 240 

ggagtcgtgc gcagggggcg gggcttcggg aaggagccac agagagggcg gggcgtagga 300 

cctgcgcttc gggggtggag tcggagcggc gcggcggcgg tcatgcggga cgcggatgca 360 

gacgcaggcg gaggcgctga cggcggggat ggccggggtg gccacagctg ccgcgggggc 420 

gtggacacag ccgcagctcc ggccggtgga gctcccccag cgcacgcgcc aggtccgggc 480 

agagacgccg cgtctgcggc cagggggtca cgaatgcggc cgcacatatt caccctcagc 540 

gtgcctttcc cgaccccctt ggaggcggaa atcgcccatg ggtccctggc accagatgcc 600 

gagccccacc aaagggtggt tgggaaggat ctcacagtga gtggcaggat cctggtcgtc 660 

cgctggaaag ctgaagactg tcgcctgctc cgaatttccg tcatcaactt tcttgaccag 720 

ctttccctgg tggtgcggac catgcagcgc tttgggcccc ccgtttcccg ctaagcctgg 780 

cctgggcaaa tggagcgagg tcccactttg cgtctccttg taggcagtgc gtccatcctt 840 

ccctagggca ggaattccca cagttgctac tttcctggga gggcctcatg ttttatctgg 900 

ttcttaaatg tttgttacta cagaaaataa aactgcgcta ctaaaaaaaa aaaaaaaaaa 960 
a 961 

<210> 196 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006014 
<400> 196 

ggcctcatgt tttatctggt tcttaaatgt ttgttactac agaaaataaa actgaggtat 60 

<210> 197 

<211> 1648 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006086 
<400> 197 

atgcgggaga tcgtgcacat ccaggccggc cagtgcggca accagatcgg ggccaagttc 60 
tgggaagtca tcagtgatga gcatggcatc gaccccagcg gcaactacgt gggcgactcg 120 
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gacttgcagc tggagcggat cagcgtctac tacaacgagg cotcttctca caagtacgtg 180 

cctcgagcca ttctggtgga cctggaaccc ggaaccatgg acagtgtccg ctcaggggcc 240 

tttggacatc tcttcaggcc tgacaatttc atctttggtc agagtggggc cggcaacaac 300 

tgggccaagg gtcactacac ggagggggcg gagctggtgg attcggtcct ggatgtggtg 360 

cggaaggagt gtgaaaactg cgactgcctg cagggcttcc agctgaccca ctcgctgggg 420 

ggggggacgg gctccggcat gggcacgttg ctcatcagca aggtgcgtga ggagtatccc 480 

gaccgcatca tgaacacctt cagcgtcgtg ccctcaccca aggtgtcaga cacggtggtg 540 

gaaccctaca acgccacgct gtccatccac cagctggtgg aaaacacgga tgaaacctac 600 

tgcatcgaca acgaggcgct ctacgacatc tgcttccgca ccctcaagct ggccacgccc 660 

acctacgggg acctcaacca cctggtatcg gccaccatga gcggagtcac cacctccttg 720 

cgcttcccgg gccagctcaa cgctgacctg cgcaagctgg ccgtcaacat ggtgcccttc 780 

ccgcgcctgc acttcttcat gcccggcttc gcccccctca ccaggcgggg cagccagcag 840 

taccgggccc tgaccgtgcc cgagctcacc cagcagatgt tcgatgccaa gaacatgatg 900 

gccgcctgcg acccgcgcca cggccgctac ctgacggtgg ccaccgtgtt ccggggccgc 960 

atgtccatga aggaggtgga cgagcagatg ctggccatcc agagcaagaa cagcagctac 1020 

ttcgtggagt ggatccccaa caacgtgaag gtggccgtgt gtgacatccc gccccgcggc 1080 

ctcaagatgt cctccacctt catcgggaac agcacggcca tccaggagct gttcaagcgc 1140 

atctccgagc agttcacggc catgttccgg cgcaaggcct tcctgcactg gtacacgggc 1200 

gagggcatgg acgagatgga gttcaccgag gccgagagca acatgaacga cctggtgtcc 1260 

gagtaccagc agtaccagga cgccacggcc gaggaagagg gcgagatgta cgaagacgac 1320 

gaggaggagt cggaggccca gggccccaag tgaaactgct cgcagctgga gtgagaggca 1380 

ggtggcggcc ggggccgaag ccagcagtgt ctaaaccccc ggagccatct tgctgccgac 1440 

accctgcttt ccccatcgcc ctagggctcc cttgccgccc tcctgcagta tttatggcct 1500 

cgtcctcccc cacctaggcc acgtgtgagc tgctcctgtc tctgtcttat tgcagctcca 1560 

ggcctgacgt tttacggttt tgttttttac tggtttgtgt ttatattttc ggggatactt 1620 
aataaatcta ttgctgtcag ataccctt 1648 

<210> 198 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006086 
<400> 198 

tttttactgg tttgtgttta tattttcggg gatacttaat aaatctattg ctgtcagata 60 

<210> 199 
<211> 3074 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_006096 
<400> 199 

aacaaacctc gcctggctcc cagctggtgc tgaagctcgt cagttcacca tccgccctcg 60 

gcttccgcgg ggcgctgggc cgccagcctc ggcaccgtcc tttcctttct ccctcgcgtt 120 

aggcaggtga cagcagggac atgtctcggg agatgcagga tgtagacctc gctgaggtga 180 

agcctttggt ggagaaaggg gagaccatca ccggcctcct gcaagagttt gatgtccagg 240 

agcaggacat cgagacttta catggctctg ttcacgtcac gctgtgtggg actcccaagg 300 

gaaaccggcc tgtcatcctc acctaccatg acatcggcat gaaccacaaa acctgctaca 360 

accccctctt caactacgag gacatgcagg agatcaccca gcactttgcc gtctgccacg 420 

tggacgcccc tggccagcag gacggcgcag cctccttccc cgcagggtac atgtacccct 4 80 

ccatggatca gctggctgaa atgcttcctg gagtccttca acagtttggg ctgaaaagca 540 

ttattggcat gggaacagga gcaggcgcct acatcctaac tcgatttgct ctaaacaacc 600 

ctgagatggt ggagggcctt gtccttatca acgtgaaccc ttgtgcggaa ggctggatgg 660 

actgggccgc ctccaagatc tcaggatgga cccaagctct gccggacatg gtggtgtccc 720 

acctttttgg gaaggaagaa atgcagagta acgtggaagt ggtccacacc taccgccagc 780 

acattgtgaa tgacatgaac cccggcaacc tgcacctgtt catcaatgcc tacaacagcc 840 
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ggcgcgacct ggagattgag cgaccaatgc cgggaaccca cacagtcacc ctgcagtgcc 900 

ctgctctgtt ggtggttggg gacagctcgc ctgcagtgga tgccgtggtg gagtgcaact 960 

caaaattgga cccaacaaag accactctcc tcaagatggc ggactgtggc ggcctcccgc 1020 

agatctccca gccggccaag ctcgctgagg ccttcaagta cttcgtgcag ggcatgggat 1080 

acatgccctc ggctagcatg acccgcctga tgcggtcccg cacagcctct ggttccagcg 114 0 

tcacttctct ggatggcacc cgcagccgct cccacaccag cgagggcacc cgaagccgct 1200 

cccacaccag cgagggcacc cgcagccgct cgcacaccag cgagggggcc cacctggaca 1260 

tcacccccaa ctcgggtgct gctgggaaca gcgccgggcc caagtccatg gaggtctcct 1320 

gctaggcggc ctgcccagct gccgcccccg gactctgatc tctgtagtgg ccccctcctc 1380 

cccggcccct tttcgccccc tgcctgccat actgcgccta actcggtatt aatccaaagc 1440 

ttattttgta agagtgagct ctggtggaga caaatgaggt ctattacgtg ggtgccctct 1500 

ccaaaggcgg ggtggcggtg gaccaaagga aggaagcaag catctccgca tcgcatcctc 1560 

ttccattaac cagtggccgg ttgccactct cctcccctcc ctcagagaca ccaaactgcc 1620 

aaaaacaaga cgcgtagcag cacacacttc acaaagccaa gcctaggccg ccctgagcat 1680 

cctggttcaa acgggtgcct ggtcagaagg ccagccgccc acttcccgtt tcctctttaa 1740 

ctgaggagaa gctgatccag tttccggaaa caaaatcctt ttctcatttg gggagggggg 1800 

taatagtgac atgcaggcac ctcttttaaa caggcaaaac aggaaggggg aaaaggtggg 18 60 

attcatgtcg aggctagagg catttggaac aacaaatcta cgtagttaac ttgaagaaac 1920 

cgatttttaa agttggtgca tctagaaagc tttgaatgca gaagcaaaca agcttgattt 1980 

ttctagcatc ctcttaatgt gcagcaaaag caggcgacaa aatctcctgg ctttacagac 2040 

aaaaatattt cagcaaacgt tgggcatcat ggtttttgaa ggctttagtt ctgctttctg 2100 

cctctcctcc acagccccaa cctcccaccc ctgatacatg agccagtgat tattcttgtt 2160 

cagggagaag atcatttaga tttgttttgc attccttaga atggagggca acattccaca 2220 

gctgccctgg ctgtgatgag tgtccttgca ggggccggag taggagcact ggggtggggg 2280 

tggaattggg gttactcgat gtaagggatt ccttgttgtt gtgttgagat ccagtgcagt 2340 

tgtgatttct gtggatccca gcttggttcc aggaattttg tgtgattggc ttaaatccag 2400 

ttttcaatct tcgacagctg ggctggaacg tgaactcagt agctgaacct gtctgacccg 24 60 

gtcacgttct tggatcctca gaactctttg ctcttgtcgg ggtgggggtg ggaactcacg 2520 

tggggagcgg tggctgagaa aatgtaagga ttctggaata catattccat gggactttcc 2580 

ttccctctcc tgcttcctct tttcctgctc cctaaccttt cgccgaatgg ggcagcacca 2640 

ctgacgtttc tgggcggcca gtgcggctgc caggttcctg tactactgcc ttgtactttt 2700 

cattttggct caccgtggat tttctcatag gaagtttggt cagagtgaat tgaatattgt 2760 

aagtcagcca ctgggacccg aggatttctg ggaccccgca gttgggagga ggaagtagtc 2820 

cagccttcca ggtggcgtga gaggcaatga ctcgttacct gccgcccatc accttggagg 2880 

ccttccctgg ccttgagtag aaaagtcggg gatcggggca agagaggctg agtacggatg 2940 

ggaaactatt gtgcacaagt ctttccagag gagtttctta atgagatatt tgtatttatt 3000 

tccagaccaa taaatttgta actttgcagc ggaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3060 
aaaaaaaaaa aaaa 3074 

<210> 200 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006096 
<400> 200 

gagtacggat gggaaactat tgtgcacaag tctttccaga ggagtttctt aatgagatat 60 

<210> 201 
<211> 2148 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_006115 
<400> 201 

gcttcagggt acagctcccc cgcagccaga agccgggcct gcagcgcctc agcaccgctc 60 

cgggacaccc cacccgcttc ccaggcgtga cctgtcaaca gcaacttcgc ggtgtggtga 120 
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actctctgag 
gagacctaga 
acgaaggcgt 
cccacggaga 
tgccgccctg 
cgggagacac 
tctgggagtg 
tggacttgat 
ggatttacgg 
tctgtactca 
tggtttgagc 
cctcaaggaa 
gaaaaatgta 
tatcaagatg 
tacctggaag 
gcgtagactc 
gcagtatatc 
tgtggactct 
ccccttggaa 
gtcccagagt 
cgatgtaagt 
cctggtcttt 
gagccactgc 
cttgcagagt 
tgtccccctg 
tctgcatgcc 
tagtgccaac 
gtgcccctgt 
ttggacacta 
acaaatgttc 
gttcagtgag 
gtgatctttg 
gattctggct 
tgttgaaaat 



gaaaaaccat 
aatccaagcg 
ttgtggggtt 
cttgtggagc 
gagttgctgc 
agccagaccc 
ctgatgaagg 
gtgctccttg 
aagaactctc 
tttccagagc 
acagaggcag 
ggtgcctgtg 
ctacgcctgt 
atcctgaaaa 
ctacccacct 
ctcctctccc 
gcccagttca 
ttatttttcc 
accctctcaa 
cccagcgtca 
cccgagcccc 
gatgagtgtg 
tcccagctta 
ctcctgcagc 
gagagttatg 
aggctcaggg 
ccctgtcctc 
ttcatgccta 
aagccaggat 
agtgtgagtg 
gaaaaaaagg 
gggagataca 
tgggaagtac 
aaagagaagc 



tttgattatt 
ttggaggtcc 
ccattcagag 
tggcagggca 
ccagggagct 
tgaaggcaat 
gacaacatct 
cccaggaggt 
atcaggactt 
cagaagcagc 
agcagccctt 
atgaattgtt 
gctgtaagaa 
tggtgcagct 
tggcgaaatt 
acatccatgc 
cctctcagtt 
ttagaggccg 
taactaactg 
gtcagctaag 
tccaagctct 
ggatcacgga 
caaccttaag 
acctcatcgg 
aggacatcca 
agttgctgtg 
actgtgggga 
actagctggg 
gtgcatgcat 
aggaaaacat 
ggaagttggg 
tcttatagag 
atgtaggagt 
aatgtgaagc 



actctcagac 
tgaggccagc 
ccgatacatc 
gagcctgctg 
cttcccgcca 
ggtgcaggcc 
tcacctggag 
tcgccccagg 
ctggactgta 
tcagcccatg 
cattccagta 
ctcctacctc 
gctgaagatt 
ggactctatt 
ttctccttac 
atcttcctac 
cctcagtctg 
cctggatcag 
ccggctttcg 
tgtcctgagt 
gctggagaga 
tgatcagctc 
cttctacggg 
gctgagcaat 
tggtaccctc 
tgagttgggg 
cagaaccttc 
tgcacatatc 
cttgaagcaa 
gttcagtgag 
gataggcaga 
ttagaaatag 
taatccctgt 
aaaaaaaaaa 



gtgcgtggca acaagtgact 
ctaagtcgct tcaaaatgga 
agcatgagtg tgtggacaag 
aaggatgagg ccctggccat 
ctcttcatgg cagcctttga 
tggcccttca cctgcctccc 
accttcaaag ctgtgcttga 
aggtggaaac ttcaagtgct 
tggtctggaa acagggccag 
acaaagaagc gaaaagtaga 
gaggtgctcg tagacctgtt 
attgagaaag tgaagcgaaa 
tttgcaatgc ccatgcagga 
gaagatttgg aagtgacttg 
ctgggccaga tgattaatct 
atttccccgg agaaggaaga 
cagtgcctgc aggctctcta 
ttgctcaggc acgtgatgaa 
gaaggggatg tgatgcatct 
ctaagtgggg tcatgctgac 
gcctctgcca ccctccagga 
cttgccctcc tgccttccct 
aattccatct ccatatctgc 
ctgacccacg tgctgtatcc 
cacctggaga ggcttgccta 
cggcccagca tggtctggct 
tatgacccgg agcccatcct 
aaatgcttca ttctgcatac 
caaagcagcc acagtttcag 
gaaaaaacat tcagacaaat 
tgttgacttg aggagttaat 
aatctgaatt tctaaaggga 
gtagactgtt gtaaagaaac 
2148 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 



<210> 202 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006115 
<400> 202 

tggggagata catcttatag agttagaaat agaatctgaa tttctaaagg gagattctgg 



<210> 203 
<211> 1051 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_006332 
<400> 203 

ggaccgccgc ctggttaaag gcgcttattt 
tgcccctgct gcgatgaccc tgtcgccact 
gctggacgtc cccacggcgg cggtgcaggc 
gaatgggcca ccagttaact acaagacagg 
gtccaatgca ccgcttgtca atgtgaccct 



cccaggcagc cgctgcagtc gccacacctt 60 

tctgctgttc ctgccaccgc tgctgctgct 120 

gtcccctctg caagcgttag acttctttgg 180 

caatctatac ctgcgggggc ccctgaagaa 240 

ctactatgaa gcactgtgcg gtggctgccg 300 
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agccttcctg atccgggagc tcttcccaac atggctgttg gtcatggaga tcctcaatgt 360 

cacgctggtg ccctacggaa acgcacagga acaaaatgtc agtggcaggt gggagttcaa 420 

gtgccagcat ggagaagagg agtgcaaatt caacaaggtg gaggcctgcg tgttggatga 480 

acttgacatg gagctagcct tcctgaccat tgtctgcatg gaagagtttg aggacatgga 54 0 

gagaagtctg ccactatgcc tgcagctcta cgccccaggg ctgtcgccag acactatcat 600 

ggagtgtgca atgggggacc gcggcatgca gctcatgcac gccaacgccc agcggacaga 660 

tgctctccag ccaccacacg agtatgtgcc ctgggtcacc gtcaatggga aacccttgga 720 

agatcagacc cagctcctta cccttgtctg ccagttgtac cagggcaaga agccggatgt 780 

ctgcccttcc tcaaccagct ccctcaggag tgtttgcttc aagtgatggc cggtgagctg 840 

cggagagctc atggaaggcg agtgggaacc cggctgcctg cctttttttc tgatccagac 900 

cctcggcacc tgctacttac caactggaaa attttatgca tcccatgaag cccagataca 960 

caaaattcca ccccatgatc aagaatcctg ctccactaag aatggtgcta aagtaaaact 1020 
agtttaataa gcaaaaaaaa aaaaaaaaaa a 1051 

<210> 204 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006332 
<400> 204 

aaattccacc cctagatcaa gaatcctgct ccactaagaa tggtgctaaa gtaaaactag 60 

<210> 205 
<211> 1714 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_006417 
<400> 205 

ggggcatttt gtgcctgcct agctatccag acagagcagc taccctcagc tctagctgat 60 

actacagaca gtacaacaga tcaagaagta tggcagtgac aactcgtttg acacggttgc 120 

acgaaaagat cctgcaaaat cattttggag ggaagcggct tagccttctc tataagggta 180 

gtgtccatgg attccgtaat ggagttttgc ttgacagatg ttgtaatcaa gggcctactc 240 

taacagtgat ttatagtgaa gatcatatta ttggagcata tgcggaagag agttaccagg 300 

aaggaaagta tgcttccatc atcctttttg cacttcaaga tactaaaatt tcagaatgga 360 

aactaggact atgtacacca gaaacactgt tttgttgtga tgttacaaaa tataactccc 420 

caactaattt ccagatagat ggaagaaata gaaaagtgat tatggactta aagacaatgg 480 

aaaatcttgg acttgctcaa aattgtacta tctctattca ggattatgaa gtttttcgat 540 

gcgaagattc actggatgaa agaaagataa aaggggtcat tgagctcagg aagagcttac 600 

tgtctgcctt gagaacttat gaaccatatg gatccctggt tcaacaaata cgaattctgc 660 

tgctgggtcc aattggagct gggaagtcca gctttttcaa ctcagtgagg tctgttttcc 720 

aagggcatgt aacgcatcag gctttggtgg gcactaatac aactgggata tctgagaagt 780 

ataggacata ctctattaga gacgggaaag atggcaaata cctgccgttt attctgtgtg 840 

actcactggg gctgagtgag aaagaaggcg gcctgtgcag ggatgacata ttctatatct 900 

tgaacggtaa cattcgtgat agataccagt ttaatcccat ggaatcaatc aaattaaatc 960 

atcatgacta cattgattcc ccatcgctga aggacagaat tcattgtgtg gcatttgtat 1020 

ttgatgccag ctctattcaa tacttctcct ctcagatgat agtaaagatc aaaagaattc 1080 

gaagggagtt ggtaaacgct ggtgtggtac atgtggcttt gctcactcat gtggatagca 114 0 

tggatttgat tacaaaaggt gaccttatag aaatagagag atgtgagcct gtgaggtcca 1200 

agctagagga agtccaaaga aaacttggat ttgctctttc tgacatctcg gtggttagca 1260 

attattcctc tgagtgggag ctggaccctg taaaggatgt tctaattctt tctgctctga 1320 

gacgaatgct atgggctgca gatgacttct tagaggattt gccttttgag caaataggga 1380 

atctaaggga ggaaattatc aactgtgcac aaggaaaaaa atagatatgt gaaaggttca 1440 

cgtaaatttc ctcacatcac agaagattaa aattcagaaa ggagaaaaca cagaccaaag 1500 

agaagtatct aagaccaaag ggatgtgttt tattaatgtc taggatgaag aaatgcatag 1560 

aacattgtag tacttgtaaa taactagaaa taacatgatt tagtcataat tgtgaaaaat 1620 
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agtaataatt tttcttggat ttatgttctg tatctgtgaa aaaataaatt tcttataaaa 1680 
ctcggaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 1714 

<210> 206 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_006417 
<400> 206 

atgacatatt ctatatcttg aacggtaaca ttcgtgatag ataccagttt aatcccatgg 60 

<210> 207 
<211> 3791 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_006461 
<400> 207 

•acagacggcg ggtgaacatg gcgtcctcga cttggtctga gacgtgatag gcctgccttc 
tggttgaaga tgtggcgagt gaaaaaactg agcctcagcc tgtcgccttc gccccagacg 
ggaaaaccat ctatgagaac tcctctccgt gaacttaccc tgcagcccgg tgccctcacc 
acctctggaa aaagatcccc cgcttgctcc tcgctgaccc catcactgtg caagctgggg 
ctgcaggaag gcagcaacaa ctcgtctcca gtggattttg taaataacaa gaggacagac 
ttatcttcag aacatttcag tcattcotca aagtggctag aaacttgtca gcatgaatca 
gatgagcagc ctctagatcc aattccccaa attagctcta ctcctaaaac gtctgaggaa 
gcagtagacc cactgggcaa ttatatggtt aaaaccatcg tccttgtacc atctccactg 
gggcagcaac aagacatgat atttgaggcc cgtttagata ccatggcaga gacaaacagc 
atatctttaa atggaccttt gagaacagac gatctggtga gagaggaggt ggcaccctgc 
atgggagaca ggttttcaga agttgctgct gtatctgaga aacctatctt tcaggaatct 
ccgtcccatc tcttagagga gtctccacca aatccctgtt ctgaacaact acattgctcc 
aaggaaagcc tgagcagtag aactgaggct gtgcgtgagg acttagtacc ttctgaaagt 
aacgccttct tgccttcctc tgttctctgg ctttcccctt caactgcctt ggcagcagat 
ttccgtgtca atcatgtgga cccagaggag gaaattgtag agcatggagc tatggaggaa 
agagaaatga ggtttcccac acatcctaag gagtctgaaa cagaagatca agcacttgtc 
tcaagtgtgg aagatattct gtccacatgc ctgacaccaa atctagtaga aatggaatcc 
caagaagctc caggcccagc agtagaagat gttggtagga ttcttggctc tgatacagag 
tcttggatgt ccccactggc ctggctggaa aaaggtgtaa atacctccgt catgctggaa 
aatctccgcc aaagcttatc ccttccctcg atgcttcggg atgctgcaat tggcactacc 
cctttctcta cttgctcggt ggggacttgg tttactcctt cagcaccaca ggaaaagagt 
acaaacacat cccagacagg cctggttggc accaagcaca gtacttctga gacagagcag 
ctcctgtgtg gccggcctcc agatctgact. gccttgtctc gacatgactt ggaagataac 
ctgctgagct ctcttgtcat tgtggagttt ctctcccgcc agcttcggga ctggaagagc 
cagctggctg tccctcaccc agaaacccag gacagtagca cacagactga cacatctcac 
agtgggataa ctaataaact tcagcatctt aaggagagcc atgagatggg acaggcccta 
cagcaggcca gaaatgtcat gcaatcatgg gtgcttatct ctaaagagct gatatccttg 
cttcacctat ccctgttgca tttagaagaa gataagacta ctgtgaatca ggagtctcgg 
cgtgcagaaa cattggtctg ttgctgtttt gatttgctga agaaattgag ggcaaagctc 
cagagcctca aagcagaaag ggaggaggca aggcacagag aggaaatggc tctcagaggc 
aaggatgcgg cagagatagt gttggaggct ttctgtgcac acgccagcca gcgcatcagc 
cagctggaac aggacctagc atccatgcgg gaattcagag gccttctgaa ggatgcccag 
acccaactgg tagggcttca tgccaagcaa gaagagctgg ttcagcagac agtgagtctt 
acttctacct tgcaacaaga ctggaggtcc atgcaactgg attatacaac atggacagct 
ttgctgagtc ggtcccgaca actcacagag aaactcacag tcaagagcca gcaagccctg 
caggaacgtg atgtggcaat tgaggaaaag caggaggttt ctagggtgct ggaacaagtc 
tctgcccagt tagaggagtg caaaggccaa acagaacaac tggagttgga aaacattcgt 
ctagcaacag atctccgggc tcagttgcag attctggcca acatggacag ccagctaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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gagctacaga 


gtcagcatac 


ccattgtgcc 


caggacctgg 


ctatgaagga 


tgagttactc 


2340 


tgccagctta 


cccagagcaa 


tgaggagcag 


gctgctcaat 


gcgtaaagga 


agagatggca 


2400 


ctaaaacaca 


tgcaggcaga 


actgcagcag 


caacaagctg 




agaggtgcgg 




gacctgaaag 


agaccttgga 


gtttgcagac 


caggagaatc 


aggttgctca 


cc ^ g ^ 9ctg 




ggtcaggttg 


agtgtcaatt 


gaaaaccaca 




tccgggagcg 


cagc 9 ca y" 




tgtgagaacc 


tcaaggacac 


tgtagagaac 


ctaacggcta 




caeca agca 


2640 


gataaccagg 


agcaagatct 


ggagaaaaca 




ctca a ct 


ggg g g 




actgagcaac 


tacagagcct 


gactctcttt 


ctacagacaa 




gaagactgaa 




caagagaccc 


ttctgctgag 


tacagcctgt 


cctcccaccc 


aoaa 3 cacc a 
yy 


tetgectaat 


2820 


gacaggacct 


tcctgggaag 


catcttgaca 


gcagtggcag 


gaagagcc 


agaatcaact 




cctgtgccct 


tgcttggaag 






gag agca c 


aatggtttcc 


oa n 


cttcagcccg 


cagagacccc 


aggcatggag 


oa a 3 cc 
rata t 


cagaaa gag 


tattatgact 


^nnn 


actgagcttc 


agagtctttg 


ttccctgcta 






ca caggac 




ctgcagcgaa 


aaatttgtga 


gctgcaagct 


art- 


ccca 393 ^ 90 


a ? ag ff ° ag 


on 


gaagtccaga 


aggcaaaaga 


agcagacata 


era a t 
y y y g 


acca^^ctt 


g g g gc 




tacaagaatg 


aaaaggagct 


ccaggaagtg 




t 






atagacaaga 


gtggcgagct 


cataagcctt 


agagaggagg 


tgacccacct 


tacccgctca 


3300 


cttcggcgtg 


cggagacaga 


gaccaaagtg 


ctccaggagg 


cctggcaggc 


cagctggact 


3360 


ccaactgcca 


gcctatggcc 


accaattgga 


tccaggagaa 


agtgtggctc 


tctcaggagg 


3420 


tggacaaact 


gagagtgatg 


ttcctggaga 


tgaaaaatga 


gaaggaaaac 


tcctgatcaa 


3480 






atcctagagg 


agaaccttcg 


gcgctctgac 


aaggagttag 


3540 


aaaaactaga 


tgacattgtt 


cagcatattt 


ataagaccct 


gctctctatt 


ccagaggtgg 


3600 


tgaggggatg 


caaagaacta 


cagggattgc 


tggaatttct 


gagctaagaa 


actgaaagee 


3660 


agaatttgtt 


tcacctcttt 


ttacctgcaa 


taccccctta 


ccccaataoc 


aagaccaact 


3720 


ggcatagagc 


caactgagat 


aaatgctatt 


taaataaagt 


gtatttaatg 


aaaaaaaaaa 


3780 


aaaaaaaaaa 


a 3791 












<210> 208 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_006461 












<400> 208 














ctgacaagga 


gttagaaaaa 


ctagatgaca 


ttgttcagca 


tatttataag 


accctgctct 


60 


<210> 209 














<211> 2856 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_006516 












<400> 209 














tagtcgcggg 


tccccgagtg 




ggagcaggag 


accaaacgac 


gggggtcgga 


60 


gtcagagtcg 


cagtgggagt 


ccccggaccg 




ctgagcggga 


gagcgccgct 


i o! 


cgcacgcccg 


tcgccacccg 


cgtacccggc 


acaacc 


ccaccagcgc 


agcgctgcca 


0 


tggagcccag 


cagcaagaag 






ggctgtggga 


ggagcagtgc 




ttggctccct 


gcagtttggc 


tacaacactg 


gagtcatcaa 




aaggtgatcg 


300 


aggagttcta 


caaccagaca 


tgggtccacc 


gctatgggga 


gagcatcctg 


cccaccacgc 


360 


tcaccacgct 


ctggtccctc 


tcagtggcca 


tcttttctgt 


tgggggcatg 


attggctcct 


420 


tctctgtggg 


ccttttcgtt 


aaccgctttg 


gccggcggaa 


ttcaatgctg 


atgatgaacc 


480 


tgctggcctt 


cgtgtccgcc 


gtgctcatgg 


gcttctcgaa 


actgggcaag 


tcctttgaga 


540 


tgctgatcct 


gggccgcttc 


atcatcggtg 


tgtactgcgg 


cctgaccaca 


ggcttcgtgc 


600 


ccatgtatgt 


gggtgaagtg 


tcacccacag 


cctttcgtgg 


ggccctgggc 


accctgcacc 


660 


agctgggcat 


cgtcgtcggc 


atcctcatcg 


cccaggtgtt 


cggcctggac 


tccatcatgg 


720 


gcaacaagga 


cctgtggccc 


ctgctgctga 


gcatcatctt 


catcccggcc 


ctgctgcagt 


780 


gcatcgtgct 


gcccttctgc 


cccgagagtc 


cccgcttcct 


gctcatcaac 


cgcaacgagg 


840 
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agaaccgggc 




a tc ^a^a 


tgcgcgggac 


agctgacgtg 


acccatgacc 




tgcaggagat 




g gg g 


tgatgcggga 


gaagaaggtc 


accatcctgg 




agctgttccg 


ctcccccgcc 




? Cat ^ tCat 


cgctgtggtg 


ctgcagctgt 


i non 


cccagcagct 


gtctggcatc 


tatarr C 


tt t ° 


cacgagcatc 


ttcgagaagg 


i nnn 




gcagcctgtg 




ggc ccgg 


a eg caac 


acggccttca 




ctgtcgtgtc 


gctgtttgtg 


crtaa 


yg gg g 


g c c gcac 


c ca aggee 


i 9nn 


tcgctggcat 


y y yyy L y u 


gccatactca 




9 ^ a 9 cac g 


c ggagcagc 








taatt^t" 


t t*-t 

cca^c gg 


ctttgtggcc 


ttctttgaag 




tgggfccctgg 


ccccatccca 


. . g 


? . 




ggtccacgtc 




cagctgccat 


tgccgttgca 


g9C ° C< ~ a 


ac ggacc c 


aaatttcatt 


gtgggcatgt 




gcttccagta 




, 99 c 


acg c 


catcatcttc 


actgtgctcc 


1500 


tggttctgtt 


cttcatcttc 


cc ac ca 


aagttcctga 


gactaaaggc 


cggaccttcg 


1560 


atgagatcgc 


ttcc cttc 


cggcaggggg 


gagccagcca 


aagtgataag 


acacccgagg 


1620 




cccc gggg 


gctgattccc 


aagtgtgagt 


cgccccagat 


caccagcccg 


1680 




agcagcccta 


aggatctctc 


aggagcacag 


gcagctggat 


gagacttcca 


1740 


aacctaacaa 




agccgggcct 


ggggctcctt 


tctccagcca 


gcaatgatgt 


1800 


at 

° Jf-ot tt 


tt t 
a caggac 




aggattttaa 


caaaagcaag 


actgttgctc 


1860 




agacaagcaa 


cagg a 


aattttttta 


ttactgattt 


tgttattttt 


1920 


9 


gagtctcctg 


gcccaca c 


ccaggcttca 


ccctgaatgg 


ttccatgcct 


1980 


gagggtggag 


actaagccct 


gtcgagacac 


ttgccttctt 


cacccagcta 


atctgtaggg 


2040 


ctggacctat 


gtcctaagga 


cacactaatc 


gaactatgaa 


ctacaaagct 


tctatcccag 


2100 


gaggtggcta 


ggccacccg 


ttctgctggc 


ctggatctcc 


ccactctagg 


ggtcaggctc 


2160 


cattaggatt 


tgccccttcc 


catctcttcc 


tacccaacca 


ctcaaattaa 


tctttcttta 


2220 


cctgagacca 


gttgggagca 


ctggagtgca 


gggaggagag 


gggaagggee 


agtctgggct 


2280 


gccgggttct 


agtctccttt 


gcactgaggg 


ccacactatt 


accatgagaa 


gagggectgt 


2340 


gggagcctgc 


aaactcactg 


ctcaagaaga 


catggagact 


cctgccctgt' 


tgtgtataga 


■2400 


tgcaagatat 


ttatatatat 


ttttggttgt 


caatattaaa 


tacagacact 


aagttatagt 


2460 


atatctggac 


aagccaactt 


gtaaatacac 




ctgttactta 


cctaaacaga 


2520 


tataaatggc 


tggtttttag 


aaacatggtt 


ttgaaatgct 


tgtggattga 


gggtaggagg 


2580 


tttggatggg 


agtgagacag 


aagtaagtgg 


ggttgcaacc 


actgeaaegg 


cttagacttc 


2640 


gactcaggat 


ccagtccctt 


acacgtacct 


ctcatcagtg 


tcctcttgct 


caaaaatctg 


2700 


tttgatccct 


gttacccaga 


gaatatatac 


attctttatc 


ttgacattca 


aggcatttct 


2760 


atcacatatt 


tgatagttgg 


tgttcaaaaa 


aacactagtt 


ttgtgccagc 


cgtgatgctc 


2820 


aggcttgaaa 


tcgcattatt 


ttgaatgtga 


agggaa 2856 




<210> 210 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_006516 












<400> 210 














aaacagatat 


aaatggctgg 


tttttagaaa 


catggttttg 


aaatgcttgt 


aaattaaaao 


60 


<210> 211 














<211> 576 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_006607 












<400> 211 














atggctactc 


tgatctacgt 


tgataaggaa 


attggagaac 


caggcacccg 


tgtggctgcc 


60 


aaggatgtgc 


tgaagctgga 


gtctagacct 


tcaatcaaag 


cattagatgg 


gatatctcaa 


120 


gttttaacac 


cacgttttgg 


caaaacatac 


gatgctccat 


cagccttacc 


taaagctacc 


180 


agaaaggctt 


tgggcactgt 


caacagagct 


acagaaaagt 


cagtaaagac 


caatggaccc 


240 


agaaaacaaa 


aacagccaag 


cttttctgcc 


aaaaagatga 


ccgagaagac 


tgttaaaaca 


300 


aaaagttctg 


ttcctgcctc 


agatgacgcc 


tatccagaaa 


tagaaaaatt 


ctttcccttc 


360 
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aatcttctag 


actttgagag 


ttttgacctg 


cctgaagagc 


gccagattgc 


acacctcccc 


420 


ttgagtggag 


tgcctctcat 


gatccttgat 


gaggagggag 


agcttgaaaa 


gctgtttcag 


480 


ctgggccccc cttcacctgt 




tctccaccat 


gggaatgcaa 


tctgtttgca 


540 


gtctccttca 


agcattctgt 


cgaccctgga 


tgttga 576 




<210> 212 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_006607 












<400> 212 














cgcctatcca gaaatagaaa 


aattctttcc 


cttcaatctt 


ctagactttg 


agagttttga 


60 


<210> 213 














<211> 2058 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_006820 












<400> 213 














gcacgaggaa 


gccacagatc 


tcttaagaac 


tttctgtctc 


caaaccgtgg 


c gc cga a 




aatcagacag 


aacagttaat 


cctcaattta 


agcctgatct 


aacccctaga 


ga a a 




gaacaatgga 


agtgacaaca 


agattgacat 


ggaatgatga 


aaatcatctg 






tggaaatgtt 


tctttgagtc 


ttctctataa 


gtctagtgtt 


catggaggta 






tatggttgaa 


agatgcagcc 


gtcagggatg 


tactataaca 


atggcttaca 


ttgattacaa 


•inn 


tatgattgta 


gcctttatgc 


ttggaaatta 


tattaattta 


cgtgaaagtt 


ctacagagcc 




aaatgattcc 


ctatggtttt 


cacttcaaaa 


gaaaaatgac 


accactgaaa 


tagaaacttt 


atn 


actcttaaat 


acagcaccaa 


aaattattga 


tgagcaactg 


gtgtgtcgtt 


tatcgaaaac 


480 


ggatattttc 


attatatgtc 




aatttatcta 


gataaaatga 


taacaagaaa 


540 


cttgaaacta 


aggttttatg 


gccaccgtca 


gtatttggaa 


tgtgaagttt 


ttcgagttga 


600 


aggaattaag 


gataacctag 




gaggataatt 


aaagccagag 


agcacagaaa 


660 


taggcttcta 


gcagacatca 


gagactatag 


gccctatgca 


gacttggttt 


cagaaattcg 


720 


tattcttttg 


gtgggtccag 


ttgggtctgg 


aaagtccagt 


tttttcaatt 


cagtcaagtc 


780 


tatttttcat 


ggccatgtga 




cgtagtgggg 


tctgatacca 


ccagcataac 


840 


cgagcggtat 


aggatatatt 


ctgttaaaga 


tggaaaaaat 


ggaaaatctc 


tgccatttat 




gttgtgtgac 


actatggggc 


tagatggggc 


agaaggagca 


ggactgtgca 


tggatgacat 


q^n 


tccccacatc 


ttaaaaggtt 


gtatgccaga 


cagatatcag 


tttaattccc 


gtaaaccaat 


i no 


tacacctgag 


cattctactt 


ttatcacctc 


tccatctctg 


aaggacagga 


gtgt 


1080 


ggcttatgtc 


ttagacatca 


actctattga 


caatctctac 


tctaaaatgt 


tggcaaaagt 


1140 


gaagcaagtt 


cacaaagaag 


tattaaactg 


tggtatagca 


tatgtggcct 


tgcttactaa 


1200 


agtggatgat 


tgcagtgagg 


ttcttcaaga 


caacttttta 


aacatgagta 






ttctcaaagc 


cgggtcatga 


atgtccataa 


aatgctaggc 


attcctattt 


ccaatatttt 


1370 


gatggttgga 


aattatgctt 


cagatttgga 


actggacccc 


atgaaggata 






ctctgcactg 


aggcagatgc 


tgcgggctgc 


agatgatttt 


ttagaagatt 




1440 


ggaaactggt 


gcaattgaga 


gagcgttaca 


gccctgcatt 


tgagataagt 


. . ^ 
tgccutgatt 




ctgacatttg 


gcccagcctg 


tactggtgtg 


ccgcaatgag 


agtcaatctc 


tattgacagc 


i ^«n 


ctgcttcaga 


ttttgctttt 


gttcgttttg 


ccttctgtcc 


ttggaacagt 


catatctcaa 


1620 


gttcaaaggc 


caaaacctga 


gaagcggtgg 


gctaagatag 


gtcctactgc 


aaaccacccc 


1680 


tccatatttc 


cgtaccattt 


acaattcagt 


ttctgtgaca 


tctttttaaa 


ccactggagg 


1740 


aaaaatgaga 


tattctctaa 


tttattcttc 


tataacactc 


tatatagagc 


tatgtgagta 


1800 


ctaatcacat 


tgaataatag 


ttataaaatt 


attgtataga 


catctgottc 


ttaaacagat 


1860 


tgtgagttct 


ttgagaaaca 


gcgtggattt 


tacttatctg 


tgtattcaca 


gagcttagca 


1920 


cagtgcctgg 


taatgagcaa 


gcatacttgc 


cattactttt 


ccttcccact 


ctctccaaca 


1980 


tcacattcac 


tttaaatttt 


tctgtatata 


gaaaggaaaa 


ctagcctggg 


caacatgatg 


2040 


aaaccccatc 


tccactgc 2058 
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<210> 214 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006820 
<400> 214 

tgagttcttt gagaaacagc gtggatttta cttatctgtg tattcacaga gcttagcaca 

<210> 215 
<211> 2825 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> NM_006845 



<400> 215 

gcgaaattga ggtttcttgg tattgcgcgt ttctcttcct tgctgactct ccgaatggcc 60 

atggactcgt cgcttcaggc ccgcctgttt cccggtctcg ctatcaagat ccaacgcagt 120 

aatggtttaa ttcacagtgc caatgtaagg actgtgaact tggagaaatc ctgtgtttca 180 

gtggaatggg cagaaggagg tgccacaaag ggcaaagaga ttgattttga tgatgtggct 240 

gcaataaacc cagaactctt acagcttctt cccttacatc cgaaggacaa tctgcccttg 300 

caggaaaatg taacaatcca gaaacaaaaa cggagatccg tcaactccaa aattcctgct 360 

ccaaaagaaa gtcttcgaag ccgctccact cgcatgtcca ctgtctcaga gcttcgcatc 420 

acggctcagg agaatgacat ggaggtggag ctgcctgcag ctgcaaactc ccgcaagcag 480 

ttttcagttc ctcctgcccc cactaggcct tcctgccctg cagtggctga aataccattg 540 

aggatggtca gcgaggagat ggaagagcaa gtccattcca tccgtggcag ctcttctgca 600 

aaccctgtga actcagttcg gaggaaatca tgtcttgtga aggaagtgga aaaaatgaag 660 

aacaagcgag aagagaagaa ggcccagaac tctgaaatga gaatgaagag agctcaggag 720 

tatgacagta gttttccaaa ctgggaattt gcccgaatga ttaaagaatt tcgggctact 780 

ttggaatgtc atccacttac tatgactgat cctatcgaag agcacagaat atgtgtctgt 840 

gttaggaaac gcccactgaa taagcaagaa ttggccaaga aagaaattga tgtgatttcc 900 

attcctagca agtgtctcct cttggtacat gaacccaagt tgaaagtgga cttaacaaag 960 

tatctggaga accaagcatt ctgctttgac tttgcatttg atgaaacagc ttcgaatgaa 1020 

gttgtctaca ggttcacagc aaggccactg gtacagacaa tctttgaagg tggaaaagca 1080 

acttgttttg catatggcca gacaggaagt ggcaagacac atactatggg cggagacctc 1140 

tctgggaaag cccagaatgc atccaaaggg atctatgcca tggcctcccg ggacgtcttc 1200 

ctcctgaaga atcaaccctg ctaccggaag ttgggcctgg aagtctatgt gacattcttc 1260 

gagatctaca atgggaagct gtttgacctg ctcaacaaga aggccaagct gcgcgtgctg 1320 

gaggacggca agcaacaggt gcaagtggtg gggctgcagg agcatctggt taactctgct 1380 

gatgatgtca tcaagatgct cgacatgggc agcgcctgca gaacctctgg gcagacattt 1440 

gccaactcca attcctcccg ctcccacgcg tgcttccaaa ttattcttcg agctaaaggg 1500 

agaatgcatg gcaagttctc tttggtagat ctggcaggga atgagcgagg cgcagacact 1560 

tccagtgctg accggcagac ccgcatggag ggcgcagaaa tcaacaagag tctcttagcc 1620 

ctgaaggagt gcatcagggc cctgggacag aacaaggctc acaccccgtt ccgtgagagc 1680 

aagctgacac aggtgctgag ggactccttc attggggaga actctaggac ttgcatgatt 1740 

gccacgatct caccaggcat aagctcctgt gaatatactt taaacaccct gagatatgca 1800 

gacagggtca aggagctgag cccccacagt gggcccagtg gagagcagtt gattcaaatg 1860 

gaaacagaag agatggaagc ctgctctaac ggggcgctga ttccaggcaa tttatccaag 1920 

gaagaggagg aactgtcttc ccagatgtcc agctttaacg aagccatgac tcagatcagg 1980 

gagctggagg agaaggctat ggaagagctc aaggagatca tacagcaagg accagactgg 2040 

cttgagctct ctgagatgac cgagcagcca gactatgacc tggagacctt tgtgaacaaa 2100 

gcggaatctg ctctggccca gcaagccaag catttctcag ccctgcgaga tgtcatcaag 2160 

gccttacgcc tggccatgca gctggaagag caggctagca gacaaataag cagcaagaaa 2220 

cggccccagt gacgactgca aataaaaatc tgtttggttt gacacccagc ctcttccctg 2280 

gccctcccca gagaactttg ggtacctggt gggtctaggc agggtctgag ctgggacagg 2340 

ttctggtaaa tgccaagtat gggggcatct gggcccaggg cagctgggga gggggtcaga 2400 
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gtgacatggg acactccttt tctgttcctc agttgtcgcc ctcacgagag gaaggagctc 24 60 

ttagttaccc ttttgtgttg cccttctttc catcaagggg aatgttctca gcatagagct 2520 

ttctccgcag catcctgcct gcgtggactg gctgctaatg gagagctccc tggggttgtc 2580 

ctggctctgg ggagagagac ggagccttta gtacagctat ctgctggctc taaaccttct 2640 

acgcctttgg gccgagcact gaatgtcttg tactttaaaa aaatgtttct gagacctctt 2700 

tctactttac tgtctcccta gagtcctaga ggatccctac tgttttctgt tttatgtgtt 27 60 

tatacattgt atgtaacaat aaagagaaaa aataaaaaaa aaaaaaaaaa aaaaaaaaaa 2820 
aaaaa 2825 

<210> 216 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_006845 
<400> 216 

aaatgtttct gagacctctt tctactttac tgtctcccta gagtcctaga ggatccctac 60 

<210> 217 

<211> 823 

<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_007019 



<400> 217 

aaacgcgggc gggcgggccc gcagtcctgc agttgcagtc gtgttctccg agttcctgtc 60 

tctctgccaa cgccgcccgg atggcttccc aaaaccgcga cccagccgcc actagcgtcg 120 

ccgccgcccg taaaggagct gagccgagcg ggggcgccgc ccggggtccg gtgggcaaaa 180 

ggctacagca ggagctgatg accctcatga tgtctggcga taaagggatt tctgccttcc 240 

ctgaatcaga caaccttttc aaatgggtag ggaccatcca tggagcagct ggaacagtat 300 

atgaagacct gaggtataag ctctcgctag agttccccag tggctaccct tacaatgcgc 360 

ccacagtgaa gttcctcacg ccctgctatc accccaacgt ggacacccag ggtaacatat 420 

gcctggacat cctgaaggaa aagtggtctg ccctgtatga tgtcaggacc attctgctct 480 

ccatccagag ccttctagga gaacccaaca ttgatagtcc cttgaacaca catgctgccg 54 0 

agctctggaa aaaccccaca gcttttaaga agtacctgca agaaacctac tcaaagcagg 600 

tcaccagcca ggagccctga cccaggctgc ccagcctgtc cttgtgtcgt ctttttaatt 660 

tttccttaga tggtctgtcc tttttgtgat ttctgtatag gactctttat cttgagctgt 720 

ggtatttttg ttttgttttt gtcttttaaa ttaagcctcg gttgagccct tgtatattaa 780 

ataaatgcat ttttgtcctt ttttagacaa aaaaaaaaaa aaa 823 

<210> 218 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_007019 
<400> 218 

tggaaaaacc ccacagcttt taagaagtac ctgcaagaaa cctactcaaa gcaggtcacc 60 

<210> 219 
<211> 2831 
<212> DNA 

<213> Homo sapiens 



<300> 
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<308> NM 007183 



<400> 219 
gaattccgga 
gtggacctgc 
gcgtgtgctc 
cggaggccga 
agctgggaca 
ccagaggcac 
agggcctgag 
cagcctcctg 
cccacaatgg 
ccatgcccac 
atgacacact 
gcctggtgtc 
agcgccaggc 
aggtttcccc 
gcagccaccg 
ctcgagcgcc 
acgtgcatgg 
ttgatgacat 
aggtgctggg 
aggcccgcag 
aagtgcagcg 
agctggccct 
atgatgagct 
tgaaggaccg 
tgtcgggggc 
acaacgccac 
tgcgggagtg 
cgggcaaatg 
accgcctcta 
gggacctggc 
ggctgcgcga 
ccaagggcct 
agcgctgcga 
cggcaggcga 
ttctgaaccc 
ctggcctcat 
- tggtgagcca 
aggtgctggt 
cccgagacct 
acagccccga 
agtacaacaa 
gcccataggt 
tcagctccag 
tcgctggggc 
atagctgggg 
tgtatggggt 
atcttgggat 
aaaaggaatt 



caggacgtga 
cgccatgcag 
cctggcgctg 
gcggctgcgg 
gcagccgcgg 
atccaggggg 
tggggacaag 
gtcctcccgc 
gggcagcgcc 
caggcccgtg 
ctccctgcgc 
tgagcagctg 
cagctccagc 
gagccggacc 
gagccgcggg 
atctgtgcgc 
gttcaacagc 
tgacctgccc 
agcggcctac 
ccttcaggcc 
ccatgccaca 
ggtggaggag 
tcgcaaaaat 
cctggccaga 
tgggggtccc 
cggcttcctc 
ccacgggctg 
cgaggacaag 
cgacgagatg 
gggggcgccg 
gctgcccctc 
cgagtggctg 
gctcaaccgg 
ccgcaggtgg 
cctgctagac 
ccgaaacctg 
cctgatcgag 
caacatcata 
gctgtatttt 
cagtgagaag 
gctccaccgt 
gaagccttct 
gctgcttggc 
ccctgtgtgc 
acttggcttc 
ggtgacccag 
agccagcact 
c 2831 



agatagttgg 
gacggtaact 
ccctctgacc 
gcagcccgcg 
cacaacgggg 
cagtaccaca 
acctcgggct 
tccgccgtgg 
tttggggccg 
tccttccatg 
tcgctgcggc 
gagcccgcgg 
tccagccggg 
atccgtgccc 
gtaggcgggg 
agcctcagcc 
tacggtagcc 
tcagcagtca 
atccagcaca 
gtgcctaggc 
ggtgccatgc 
aacgggatct 
gtcacaggga 
gacacgctgg 
cccctcatcc 
aggaacctca 
gtggacgccc 
agcgtggaga 
ccgccgtccg 
ccgggagagg 
gccgccgatg 
tggagccccc 
cacacgacgg 
gcgggggtgc 
cgtgtcagga 
tctcggaacg 
aagctgccag 
gctgtgctca 
gacggactcc 
tcctcccggg 
gactttcggg 
ggaggagaag 
agcccagcct 
atctttgagg 
cgcagggcag 
tcacattggc 
gggaataaag 



gtttggaggc 
tcctgctgtc 
tgcagctgga 
tccaggagca 
ccgctgagcc 
ccctgcaggc 
tccggcccat 
atctgagctg 
ctgggtacgg 
agcgcggtgg 
tggggcccgg 
ccacctccac 
caggggggct 
ctgccgtgcg 
cagtgccggg 
tcagcctggc 
accgaaccct 
agtacctcat 
agtgctacag 
tggtgaagct 
gcaacctcat 
tcgagctgct 
tcctgtggaa 
agcagctcac 
agcagaacgc 
gctcagcctc 
tggtcacctc 
acgcggtgtg 
cgctgcagcg 
tcgtgggctg 
cgctcacctt 
agatcgtggg 
aggcggccgc 
tgagccgcct 
ccgccgacca 
ctaggaacaa 
gcagcgtggg 
acaacctggt 
gaaagctcat 
cagcatccag 
cgaagggcta 
gtgacgtggc 
ggaggagaag 
gtcctgggcc 
ggggtggggc 
agaggtgggg 
atggccatga 



ggccgccagg 
ggccctgcag 
ccgccggggc 
ggtccgcgcc 
cgagcctgag 
tggcttcagc 
cgccaagccg 
cagtcggagg 
gggtgcccag 
ggttgggagc 
gggcctggac 
ctacagggcc 
ggactggccc 
gaccctgcag 
ggccgtcctg 
tgactcgggc 
gcagagactc 
ggcttcagac 
cgatgcagcc 
cttcaaccac 
ctacgacaac 
gcggacactg 
cctttcatcc 
ggacctggtg 
ctcggaggcg 
tcaggccact 
tatcaaccac 
cgtcctgcgg 
gctggagggt 
cttcacgccg 
cgcggaggtg 
gctgtacaac 
cggggcgctg 
ggccctggag 
ccaccagctg 
ggacgagatg 
tgagaagtcg 
ggtggccagc 
cttcatcaag 
cctcctggcc 
tcggaaggag 
ccagcgtcca 
gctaatgacg 
accaggaggg 
agggctcaag 
gttggctgtg 
acagtcacaa 



cccaggcccg 
cctgaggccg 
gccgaggggc 
cgcctcttgc 
gccgagactg 
tctcgctctc 
gcctacagcc 
ctgagttcag 
cccacccctc 
cgggccgact 
gaccgctaca 
tttgcgtacg 
gaggccactg 
cgattccaga 
gagccagtgg 
cacctgccgg 
agcagcggtt 
cccaacctgc 
gccaagaagc 
gccaaccagg 
gctgacaaca 
cgggagcagg 
agcgaccacc 
ttgagccccc 
gagatcttct 
cgccagaaga 
gccctggacg 
aacctgtcct 
cgcggccgca 
cagagccggc 
tccaaggacc 
cggctgctgc 
cagaacatca 
caggagcgta 
cgctcactga 
tccacgaagg 
cccccagccg 
cccatcgctg 
aagaagcggg 
aacctgtggc 
gacttcctgg 
agggacagac 
gaggggcccc 
gcagggtctt 
gctgctctgg 
gcctggcagt 
aaaaaaaaaa 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 



<210> 220 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM 007183 
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ctggcagtat cttgggatag ccagcactgg 

<210> 221 
<211> 2815 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_007267 
<400> 221 

aggaagcgga ggaaggtgaa gtaggaccga 

gagcaggatg ggggctccgg aggtggcgcc 

tagcgggcat tggccagaga catggcccag 

accccagggg accagggcca gggccccagc 

ttccagcagc tcatccagga gcagagccag 

cagagagagc gggaggtgac aggaagtagc 

cagagcacgg ccacactccg catcctggcc 

cgaggtgcca tcatctccca gtactacaac 

cggcccctgc tcgggaactt tgtccgctcc 

gagctggacc ccacggccct ggaggaggag 

cagagcctgg cagtggcaca gcgggaccac 

gagaaacgca gcctgcgaga gaagagcagg 

ggcagcggcg gggtctgctc ctgctgtggc 

cacagcctgg gcctggcgct gctctccgcc 

ctgaagcgca tcgggggcca gttcggctcc 

accctgctgg ctttcaatgc cctcctgctg 

caggtcgcct tcccacccgc cctgccgggc 

ctcacaggcg cgggttgctt cacccacacc 

acgctgaacc agccgtgtgg cagccccctg 

ggcctgccct acaacatgcc cctggcctac 

acctgcatca ccctggtgta cagcatggct 

agcacctctg gcatccacgc catcaccgtc 

aagcgggcct cccgcctcca gcaggacaat 

gagtggcagc tgcggcacag ccccaggagc 

ctggggcttg tgtggctgct gtgtctgggg 

gtcttctcgg agttcatgat ccagagtcca 

gtcctgcccc tggtggttgg cctcctcaac 

gccgccctgg agccgcatga ctccccggta 

ctcatcctca agctggccat cctggggaca 

ggcgtcctgc agggccagtg ctgggaggat 

gtgatggact tcgtcctcat gttgctggac 

atctccgaga agaagctgaa gaggaggcgg 

ctggagctga tttatgggca gactctgacc 

cccgccgtgc agatcatcaa gctgctgctc 

gccaactgcc aggcgccgcg ccggccctgg 

acgctgctct gcttccccgc cttcctgggc 

caggtgaagc cctcgagcac ctgcggcccc 

ggcagggtgt gggtgcgcca cctggaggcg 

gtgcaccggt acctgatgga aaacaccttc 

gccgtgatct acctcaacat ccaggtggtg 

aaggagcaga tcagcaatga gggtgaggac 

atctacgaga ggaaggagag ggaggagagg 

gcaccccctg ccctgctcac agatgaacag 

gggcccgccc agcaccctga gaccacactg 

ggacaaggaa gacagtttcg cctctcgaaa 

gggtgggtga atccggcttg ggcatcccca 

attctgatta aaggggtttt gcaaatggga 

<210> 222 
<211> 60 




PCT/US2006/00428U 



gaataaagat ggccatgaac agtcacaaaa 60 



attcctgtgc cgaagaggcc tgcagtggga 60 

caggctctga gctaccctag gtctgcagac 120 

ccactggcct tc'atcctcga tgtccctgag 180 

ccctatgatg aaagcgaagt gcacgactcc 240 

tgcacggccc aggaggggct ggagctgcag 300 

cagcagacac tctggcggcc cgagggcacc 360 

agcatgccca gccgcaccat tggccgcagc 420 

cgcacggtgc agcttcggtg caggagcagc 4 80 

gcctggccca gcctccgcct gtacgacctg 540 

gagaagcaga gcctcctggt gaaggagttc 600 

atgcttcgcg ggatgccctt aagcctggct 660 

accccgaggg ggaagtggag gggccagccg 720 

cggctcagat atgcctgcgt gctggccttg 780 

ctgcaggccc tgatgccgtg gcgctacgcc 840 

agcgtgctct cctacttcct ctttctcaag 900 

ctgctgctgg tggccttcat catgggccct 960 

cctgcccccg tctgcacagg cctggagctc 1020 

gtcatgtact acggccacta cagtaacgcc 1080 

gatggcagcc agtgcacacc cagggtgggt 1140 

ctctccactg tgggcgtgag cttctttatc 1200 

cactctttcg gggagagcta ccgggtgggc 1260 

ttctgctcct gggactacaa ggtgacgcag 1320 

attcgcaccc ggctgaagga gctgctggcc 1380 

gtgtgcggga ggctgcggca ggcggctgtg 1440 

accgcgctgg gctgcgccgt ggccgtccac 1500 

gaggctgctg gccaggaggc tgtgctgctg 1560 

ctgggggccc cctacctgtg ccgtgtcctg 1620 

ctggaggtgt acgtggccat ctgcaggaac 1680 

ctgtgctacc actggctggg ccgcagggtg 1740 

tttgtgggcc aggagctgta ccggttcctg 1800 

acgctttttg gggaactggt gtggaggatt 1860 

aagccggagt ttgacattgc ccggaatgtc 1920 

tggctggggg tgctcttctc gcccctcctc 1980 

gtcttctatg tcaagaagac cagccttctg 2040 

ctggcctcac acatgagcac cgtcttcctc 2100 

gccgctgtct tcctctgcta cgccgtctgg 2160 

ttccggaccc tggacaccat gtacgaggcc 2220 

gcaggcccca gggtctcctg gctgccctgg 2280 

tttgtcttcc tggtgtcagc cctgctgctg 2340 

cggggccagc gcaaggtcat ctgcctgctc 2400 

aaaatcttct taatcaacaa gcttcactcc 24 60 

agcagggttg ggacaaccga ggaggctgcg 2520 

gatgcctagg gggacggcga tgggcctcac 2580 

ttgcctccca gtgaccctgc tgggacacca 264 0 

gccgcagctg cgcctaggct ggagctggaa 2700 

atgaactctg ccctgcctgg gactctattt 27 60 
aaaaaaaaaa aaaaaaaaaa aaaaa 2815 
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<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_007267 
<400> 222 

ggtgaggaca aaatcttctt aatcaacaag cttcactcca tctacgagag gaaggagagg 

<210> 223 
<211> 1893 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_007274 

<400> 223 
atttaccgcc 
ccagacgctt 
tccccgcacc 
ccccacgccg 
gcggggctcg 
tcgcccccgg 
tttatggcgc 
ctgcagccgc 
acgccgtccg 
aatgtccacg 
cggcattgca 
accgacttcc 
tacacctcca 
ggtgccaaaa 
gtggacaagg 
ggccggaagc 
gacatcgtcc 
ttgatccacc 
accatgaagc 
atcgtcacag 
atcaccatct 
gtggacgccg 
ttcacctacg 
gagaccgagg 
gcgaagcgac 
ctcgagtagc 
aaaacccaat 
tgtactctct 
ctacacagtg 
gagcacgctg 
aaaaaaaaaa 
aaaaaaaaaa 



gcgcggagag 
ccctcggggc 
ggcgcggccc 
ccagaatcgt 
gggtggggcc 
cgcgcaggcc 
ggcccgggct 
ccgccgcatc 
ccatccagat 
gggggaccat 
acagccagaa 
tgtctcccat 
agcactctgt 
agctgaccaa 
tcctcgaggt 
ggtatgaagc 
agccagtcct 
tggtggggcc 
tcatggatga 
cttccgtgga 
cgggacgcat 
accctgttgt 
tgtcgctgag 
acgagaagaa 
agggccacgc 
catggcaacg 
tcacattgag 
cctgcaaacc 
ttgtcccgag 
tagggtatgg 
aaaaaaaaaa 
aaaaaaaaaa 



tgagggccca 
tgccaccggg 
aaccgccacc 
acgcccgcgc 
ggcgcggggg 
ccgcctcacc 
cattcattcc 
cgccgccgca 
ctgccggatc 
cctgaagatg 
cggggagcgc 
gtgcatcggt 
ggaggtgcag 
taaggccacc 
gcctcctgtt 
ccagaagctg 
caacccagag 
ttcagactgc 
ggtcgccggg 
cgccattaat 
gaccttcacg 
ggacagctct 
ccaggaaggc 
gcgctttgag 
ggagcctcag 
ggcccagtgt 
agctggtgtt 
tacacaccaa 
cgccgggagg 
gaagaaccca 
aaaaaaaaaa 



agtccgccct 
tcgggcgcgg 
gaaccttctg 
gagctctctg 
cggggtcggc 
agccccatcg 
gcgccgggcc 
gcccccagca 
atgcggccag 
atcgaggagg 
tgtgtggccg 
gaggtggcgc 
gtcaacgtga 
ctgtggtatg 
gtgtattccc 
gagcgcatgg 
ccgaacactg 
accctgcacg 
atcgtggctg 
tttcatgaca 
agcaataagt 
cagaagcgct 
aggtcgctgc 
gaaggcaaag 
ccctagactc 
ccagtcactt 
gtctgaagtt 
agctttattt 
cgttgggcag 
gcaccactaa 
aaaaaaaaaa 
aaa 1893 



gctccgccac 
ctgccgcggc 
gaagcggcgg 
cagccttggc 
gcggggaggc 
ctccacctct 
tgccagacac 
tgtcgggccc 
atgatgccaa 
caggcgccat 
ccctggctcg 
atgtcagcgc 
tgtccgaaaa 
tgcccctgtc 
ggcaggagca 
agaccaagtg 
tcagctacag 
gctttgtgca 
cacgccactg 
agatcagaaa 
ccatggagat 
accgggccgc 
ctgtgcccca 
ggcggtacct 
cctcctcctg 
agaagttccc 
ttcgtatcac 
atatcattcc 
aaaccctcgg 
taaagctgct 
aaaaaaaaaa 



ttaggccgcc 
tagcgggcct 
ctgcctgggc 
ggcctgggag 
cgcgttcgat 
gccctccccc 
ctgcgccctt 
agacgtcgag 
ogtggccggc 
catcagcacc 
tgtcgagcgc 
ggagatcacc 
catcctcaca 
gctgaagaat 
ggaggaggag 
gaggaacggg 
ccagtccagc 
cggaggtgtg 
caagaccaac 
aggctgcgtc 
cgaggtgttg 
cagtgccttc ' 
gctggtgccc 
gcagatgaag 
ccactggtgc 
cccttggcca 
agtgttaacc 
agtatcaatg 
gaatgcttcc 
gcttggctgg 
aaaaaaaaaa 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



<210> 224 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_007274 
<400> 224 

acctacacac caaagcttta tttatatcat tccagtatca atgctacaca gtgttgtccc 
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<210> 225 
<211> 4157 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_007315 
<400> 225 

agcggggcgg ggcgccagcg ctgccttttc tcctgccggg tagtttcgct ttcctgcgca 60 

gagtctgcgg aggggctcgg ctgcaccggg gggatcgcgc ctggcagacc ccagaccgag 120 

cagaggcgac ccagcgcgct cgggagaggc tgcaccgccg cgcccccgcc tagcccttcc 180 

ggatcctgcg cgcagaaaag tttcatttgc tgtatgccat cctcgagagc tgtctaggtt 24 0 

aacgttcgca ctctgtgtat ataacctcga cagtcttggc acctaacgtg ctgtgcgtag 300 

ctgctccttt ggttgaatcc ccaggccctt gttggggcac aaggtggcag gatgtctcag 360 

tggtacgaac ttcagcagct tgactcaaaa ttcctggagc aggttcacca gctttatgat 420 

gacagttttc ccatggaaat cagacagtac ctggcacagt ggttagaaaa gcaagactgg 480 

gagcacgctg ccaatgatgt ttcatttgcc accatccgtt ttcatgacct cctgtcacag 540 

ctggatgatc aatatagtcg cttttctttg gagaataact tcttgctaca gcataacata 600 

aggaaaagca agcgtaatct tcaggataat tttcaggaag acccaatcca gatgtctatg 660 

atcatttaca gctgtctgaa ggaagaaagg aaaattctgg aaaacgccca gagatttaat 720 

caggctcagt cggggaatat tcagagcaca gtgatgttag acaaacagaa agagcttgac 7 80 

agtaaagtca gaaatgtgaa ggacaaggtt atgtgtatag agcatgaaat caagagcctg 840 

gaagatttac aagatgaata tgacttcaaa tgcaaaacct tgcagaacag agaacacgag 900 

accaatggtg tggcaaagag tgatcagaaa caagaacagc tgttactcaa gaagatgtat 960 

ttaatgcttg acaataagag aaaggaagta gttcacaaaa taatagagtt gctgaatgtc 1020 

actgaactta cccagaatgc cctgattaat gatgaactag tggagtggaa gcggagacag 1080 

cagagcgcct gtattggggg gccgcccaat gcttgcttgg atcagctgca gaactggttc 1140 

actatagttg cggagagtct gcagc.aagtt cggcagcagc ttaaaaagtt ggaggaattg 1200 

gaaoagaaat acacctacga acatgaccct atcacaaaaa acaaacaagt gttatgggac 1260 

cgcaccttca gtcttttcca gcagctcatt cagagctcgt ttgtggtgga aagacagccc 1320 

tgcatgccaa cgcaccctca gaggccgctg gtcttgaaga caggggtcca gttcactgtg 1380 

aagttgagac tgttggtgaa attgcaagag ctgaattata atttgaaagt caaagtctta 1440 

tttgataaag atgtgaatga gagaaataca gtaaaaggat ttaggaagtt caacattttg 1500 

ggcacgcaca caaaagtgat gaacatggag gagtccacca atggcagtct ggcggctgaa 15 60 

tttcggcacc tgcaattgaa agaacagaaa aatgctggca ccagaacgaa tgagggtcct 1620 

ctcatcgtta ctgaagagct tcactccctt agttttgaaa cccaattgtg ccagcctggt 1680 

ttggtaattg acctcgagac gacctctctg cccgttgtgg tgatctccaa cgtcagccag 1740 

ctcccgagcg gttgggcctc catcctttgg tacaacatgc tggtggcgga acccaggaat 1800 

ctgtccttct tcctgactcc accatgtgca cgatgggctc agctttcaga agtgctgagt 18 60 

tggcagtttt cttctgtcac caaaagaggt ctcaatgtgg accagctgaa catgttggga 1920 

gagaagcttc ttggtcctaa cgccagcccc gatggtctca ttccgtggac gaggttttgt 1980 

aaggaaaata taaatgataa aaattttccc ttctggcttt ggattgaaag catcctagaa 2040 

ctcattaaaa aacacctgct ccctctctgg aatgatgggt gcatcatggg cttcatcagc 2100 

aaggagcgag agcgtgccct gttgaaggac cagcagccgg ggaccttcct gctgcggttc 2160 

agtgagagct cccgggaagg ggccatcaca ttcacatggg tggagcggtc ccagaacgga 2220 

ggcgaacctg acttccatgc ggttgaaccc tacacgaaga aagaactttc tgctgttact 2280 

ttccctgaca tcattcgcaa ttacaaagtc atggctgctg agaatattcc tgagaatccc 2340 

ctgaagtatc tgtatccaaa tattgacaaa gaccatgcct ttggaaagta ttactccagg 2400 

ccaaaggaag caccagagcc aatggaactt gatggcccta aaggaactgg atatatcaag 24 60 

actgagttga tttctgtgtc tgaagttcac ccttctagac ttcagaccac agacaacctg 2520 

ctccccatgt ctcctgagga gtttgacgag gtgtctcgga tagtgggctc tgtagaattc 2580 

gacagtatga tgaacacagt atagagcatg aatttttttc atcttctctg gcgacagttt 2640 

tccttctcat ctgtgattcc ctcctgctac tctgttcctt cacatcctgt gtttctaggg 2700 

aaatgaaaga aaggccagca aattcgctgc aacctgttga tagcaagtga atttttctct 27 60 

aactcagaaa catcagttac tctgaagggc atcatgcatc ttactgaagg taaaattgaa 2820 

aggcattctc tgaagagtgg gtttcacaag tgaaaaacat ccagatacac ccaaagtatc 2880 

aggacgagaa tgagggtcct ttgggaaagg agaagttaag caacatctag caaatgttat 2940 

gcataaagtc agtgcccaac tgttataggt tgttggataa atcagtggtt atttagggaa 3000 

ctgcttgacg taggaacggt aaatttctgt gggagaattc ttacatgttt tctttgcttt 3060 
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aagtgtaact 


ggcagttttc 


cattggttta 


cctgtgaaat 


agttcaaagc 


caagtttata 


3120 


tacaattata 


tcagtcctct 


ttcaaaggta 


gccatcatgg 


atctggtagg 




3180 


gtattttatt 


acatctttca 


cattggctat 


ttaaagacaa 


agacaaattc 


tgtttcttga 


3240 


gaagagaata 


ttagctttac 


tgtttgttat 


ggcttaatga 


cactagctaa 


tatcaataga 


3300 


aggatgtaca 


tttccaaatt 


cacaagttgt 


gtttgatatc 


caaagctgaa 




3360 


tttcatcttg 


gtcacataca 


attattttta 


cagttctccc 




ggctattcac 




aaccactcat 


tcaaaagttg 


aaattaacca 


tagatgtaga 


taaactcaca 


aatttaattc 


3480 


atgtttctta 


aatgggctac 


tttgtccttt 


ttgttattag 


ggtggtattt 


agtctattag 




ccacaaaatt 








aacttgaata 


atacaccaga 




gataatatga 


gaatcagatc 


atttcaaaac 


tcatttccta 


tgtaactgca 


ttgagaactg 


3660 


catatgtttc 


gctgatatat 


gtgtttttca 


catttgcgaa 


tggttccatt 


ctctctcctg 


3720 


tactttttcc 


agacactttt 


ttgagtggat 


gatgtttcgt 


gaagtatact 


gtatttttac 


3780 


ctttttcctt 


ccttatcact 


gacacaaaaa 


gtagattaag 


agatgggttt 


gacaaggttc 


3840 


ttccctttta 


catactgctg 


tctatgtggc 


tgtatcttgt 


ttttccacta 


ctgctaccac 


3900 


aactatatta 


tcatgcaaat 


gctgtattct 


tctttggtgg 


agataaagat 


ttcttgagtt 


3960 


ttgttttaaa 


attaaagcta 


aagtatctgt 


attgcattaa 


atataatatg 


cacacagtgc 


4020 


tttccgtggc 


actgcataca 


atctgaggcc 


tcctctctca 


gtttttatat 


agatggegag 


4080 


aacctaagtt 


tcagttgatt 


ttacaattga 


aatgactaaa 


aaacaaagaa 


gacaacatta 


4140 


aaacaatatt 


gtttcta 4157 








<210> 226 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_00731S 












<400> 226 














atcagatcat 


ttcaaaactc 


atttcctatg 


taactgcatt 


gagaactgca 


tatgtttcgc 


60 


<210> 227 














<211> 1696 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_009587 












<400> 227 














caaaggactt 


cctagtgggt 


gtgaaaggca 


gcggtggcca 


cagaggcggc 


ggagagatgg 


60 


ccttcagcgg 


ttcccaggct 


ccctacctga 


gtccagctgt 


ccccttttct 


gggactattc 


120 


aaggaggtct 


ccaggacgga 


cttcagatca 


ctgtcaatgg 


gaccgttctc 


agctccagtg 


180 


gaaccaggtt 


tgctgtgaac 


tttcagactg 


gcttcagtgg 


aaatgacatt 


gccttccact 




tcaaccctcg 


gtttgaagat 


ggagggtacg 


? 9 . g gC3a 


cacgaggcag 


aaeggaaget 




gggggcccga 


ggagaggaag 


acacacatgc 




9999^ tgccc 


tttgacctct 




gcttcctggt 


gcagagctca 


gatttcaagg 


to . CC ^ gaa 


cgggatcctc 


ttcgtgcagt 




acttccaccg 


cgtgcccttc 


caccgtgtgg 


acac atct 


eg caa ggc 


tctgtgcagc 


tan 


tgtcctacat 


cagcttccag 


aacccccgca 


g 


9 g c 


ttctccacgg 




tgccgttctc 


ccagcctgtc 


tgtttcccac 


ccaggcccag 


ggggegcaga 


caaaaac c 




ccggcgtgtg 


gcctgccaac 


ccggctccca 






acagtgeaga 


n 


gcgcccctgg 


acagatgttc 


tctactcccg 


ccatcccacc 


tatgatgtac 


ccccaccccg 


720 


cctatccgat 


gcctttcatc 


accaccattc 


tgggagggct 


gtacccatcc 


aagtccatcc 


780 


tcctgtcagg 


cactgtcctg 


cccagtgctc 


agaggttcca 


catcaacctg 


tgctctggga 


840 


accacatcgc 


cttccacctg 


aacccccgtt 


ttgatgagaa 


tgctgtggtc 


cgcaacaccc 


900 


agatcgacaa 


ctcctggggg 


tctgaggagc 


gaagtctgcc 


ccgaaaaatg 


cccttcgtcc 


960 


gtggccagag 


cttctcagtg 


tggatcttgt 


gtgaagctca 


ctgcctcaag 


gtggccgtgg 


1020 


atggtcagca 


cctgtttgaa 


tactaccatc 


gcctgaggaa 


cctgcccacc 


atcaacagac 


1080 


tggaagtggg 


gggcgacatc 


cagctgaccc 


atgtgcagac 


ataggegget 


tcctggccct 


1140 


ggggccgggg 


gctggggtgt 


ggggcagtct 


gggtcctctc 


atcatcccca 


cttcccaggc 


1200 


ccagcctttc 


caaccctgcc 


tgggatctgg 


gctttaatgc 


agaggecatg 


tccttgtctg 


1260 
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gtcctgcttc 


tggctacagc caccctggaa 


cggagaaggc 


agctgacggg 


gattgccttc 


1320 


ctcagccgca 


gcagcacctg 


gggctccagc 


tgctggaatc 


ctaocatccc 


aggaggcagg 


1380 




gagaggggag gagtgggcag 


tgaagatgaa 


gccccatgct 


cagtcccctc 


1440 


ccatccccca 


cgcagctcca 


ccccagtccc 


aagccaccag 


ctgtctgctc 


ctggtgggag 


1500 


gtggcctcct 


cagcccctcc tctctgacct 


ttaacctcac 


tctcaccttg 


caccgtgcac 


1560 


caacccttca 


cccctcctgg aaagcaggcc 


tgatggcttc 


ccactggcct 


ccaccacctg 


1620 


accagagtgt 


tctcttcaga 


ggactggctc 


ctttcccagt 


gtccttaaaa 


taaagaaatg 


1680 


aaaatgcttg 


ttggca 1696 








<210> 228 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_009587 












<400> 228 














cagaggactg 


gctcctttcc 


cagtgtcctt 


aaaataaaga 


aatgaaaatg 


cttgttggca 


60 


<210> 229 














<2U> 6552 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_012291 












<400> 229 














atgaggagct 


tcaaaagagt 


caactttggg 


actctgctaa 


gcagccagaa 


ggaggctgaa 


60 


gagttgctgc 


ccgacttgaa 


ggagttcctg 


tccaaccctc 


cagctggttt 


tcccagcagc 


120 


cgatctgatg 


ctgagaggag 


acaagcttgt 


gatgccatcc 


tgagggcttg 


caaccagcag 


180 


ctgactgcta 


agctagcttg 




ctggggagcc 


tgctggagct 


ggcagagctg 


240 


gcctgtgatg 


gctacttagt 




cagcgt cctc 


ccctctacct 


ggaacgaatt 


300 


ctctttgtct 


tactgcggaa 


tactacto 


caaggaagcc 


cagaggccac 


actccgcctt 


360 


gctcagcccc 


tccatgcctg 




tgctctcgcg 


aggctgctcc 


ccaggactat 


420 


gaggccgtgg 


ctcggggcag 


cttttctctg 


ctttggaagg 


gggcagaagc 


cctgttggaa 


480 


cggcgagctg 


catttgcagc 


tcggctgaag 


gccttgagct 


tcctagtact 


cttggaggat 


540 


gaaagtaccc 


cttgtgaggt 


tcctcacttt 


gcttctccaa 


cagcctgtcg 


agcggtagct 


600 


gcccatcagc 


tatttgatgc 




ggtctaaatg 


aagcagatgc 


tgatttccta 


660 


gatgacctgc 


tctccaggca 


cgtgatcaga 


gocttggtgg 


gtgagagagg 


gagctcttct 


720 


gggcttcttt 


ctccccagag 


ggccctctgc 


ctcttggagc 


tcaccttgga 


acactgccgt 


780 


cgcttttgct 


ggagccgcca 


ccatgacaaa 


gccatcagcg 


cagtggagaa 


ggctcacagt 


840 


tacctaagga 


acaccaatct 


agcccctagc 


cttcagctat 


gtcagctggg 


ggttaagctg 


900 


ctgcaggttg 


gggaggaagg 


acctcaggca 


gtggccaagc 


ttctgatcaa 


ggcatcagct 


960 


gtcctgagca 


agagtatgga 


ggcaccatca 


cccccacttc 


gggcattgta 


tgagagctgc 


1020 


cagttcttcc 


tttcaggcct 


ggaacgaggc 


accaagaggc 


gctatagact 


tgatgccatt 


1080 


ctgagcctct 


ttgcttttct 


tggagggtac 


tgctctcttc 


tgcagcagct 


gcgggatgat 


1140 


ggtgtgtatg 


ggggctcctc 


caagcaacag 


cagtcttttc 


ttcagatgta 


ctttcaggga 


1200 


cttcacctct 


acactgtggt 


ggtttatgac 


tttgcccaag 


gctgtcagat 


agttgatttg 


1260 


gctgacctga 


cccaactagt 


ggacagttgt 


aaatctaccg 


ttgtctggat 


gctggaggcc 


1320 


ttagagggcc 


tgtcgggcca 


agagctgacg 


gaccacatgg 


ggatgaccgc 


ttcttacacc 


1380 


agtaatttgg 


cctacagctt 


ctatagtcac 


aagctctatg 


ccgaggcctg 


tgccatctct 


1440 


gagccgctct 


gtcagcacct 


gggtttggtg 


aagccaggca 


cttatcccga 


ggtgcctcct 


1500 


gagaagttgc 


acaggtgctt 


ccggctacaa 


gtagagagtt 


tgaagaaact 


gggtaaacag 


1560 


gcccagggct 


gcaagatggt 


gattttgtgg 


ctggcagccc 


tgcaaccctg 


tagccctgaa 


1620 


cacatggctg 


agccagtcac 


tttctgggtt 


cgggtcaaga 


tggatgcggc 


cagggctgga 


1680 


gacaaggagc 


tacagctaaa 


gactctgcga 


gacagcctca 


gtggctggga 


cccggagacc 


1740 


ctggccctcc 


tgctgaggga 


ggagctgcag 


gcctacaagg 


cggtgcgggc 


cgacactgga 


1800 


caggaacgct 


tcaacatcat 


ctgtgacctc 


ctggagctga 


gccccgagga 


gacaccagcc 


1860 
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ggggcctggg 
tttacgcagc 
gactctgtga 
ttgctgtggc 
cggagagccc 
gataaactcc 
gctgctcagt 
gggcaggccc 
gccctctacc 
attgtctctg 
cagctcctcc 
tcgagcctga 
gatctgcttc 
ctgctgtctg 
cgtgtccagg 
cactccctgt 
gactcccata 
actcagaaag 
tggcaggttc 
ctgggtagtg 
ctgcagatac 
cgcaatgaca 
tgcacagagt 
aaggggaagc 
ctaagaggcc 
ccttctacaa 
tcccattcac 
ctgcgctggg 
ctggatctgc 
gctctccaag 
gagatcttgg 
gagagcctgc 
ctagagccct 
agctgctgta 
agtgtccctg 
caaaagttag 
agaggactgc 
cccttcacgg 
cccagggtac 
ttggaagacc 
gcttcccggg 
gccccaggta 

gtggcatcaa 
ggccctgaga 
agctttgaga 
gctccgggcc 
aagaagctgc 
cagctcccct 
ctgagtgttg 
tgccgcttcc 
accgagtctg 
agcaaggccc 
cttcaggaga 
gctttggaat 
ctgatcccca 
gtgggcaaca 
attcccactg 
cagaaggcac 
ctggcactgg 
tgctggaagg 



cacgagccac 
agaccaactg 
ggcctgaggc 
tttacatctg 
aggcccctgg 
aggaagatcg 
ccaaatgcct 
cagctgtacg 
agctggtggc 
agagactgaa 
tgaccctcgg 
agcatctcga 
gaagtcaact 
tgcttcggga 
tcctgcagct 
gggagcagct 
agctcctccg 
cagctgtgga 
tttcagaggt 
tgagtgaagc 
cacgccagtg 
ttgatctctg 
ttggtggggt 
agcaggccca 
ctgctctaga 
actcctcccc 
ccacctgtga 
tattggtcac 
tgcaggtcgt 
cttccctgaa 
ctcaagcata 
agaaggttct 
ggcgagccag 
ctacccaact 
gctcagagcc 
cctctgctcc 
cctgcacacc 
tgtttgagga 
aacagagagt 
ctgtctcagc 
gccgggggcg 
gtgcccctgg 

gacattgtga 
tcatgaggac 
tcctcagggg 
ctgaggcagc 
ccagcccatg 
cagcccccgt 
ctttccgggg 
tggccttgtg 
tctccatcac 
agaagcaccg 
tgcctggaga 
ctggccactt 
gtggggtgac 
ccctcctgct 
gccagaacaa 
agaaagagaa 
accacaggat 
ggctgctgct 



ccacctggta 
ctctgctctg 
ccaggccaga 
tactctggaa 
taacttggag 
tttcctatac 
ggaccaagcc 
gtgtctccag 
aaagcccatg 
ggaccactcg 
ctgtcccagc 
tcagactact 
ctactggact 
tcctgccctc 
ggtggcagct 
ctgtgcccaa 
aagcatcatc 
gacatcgttt 
gctgagctgc 
caaggccttt 
tgccctgttc 
tcagtcggac 
gactcagcac 
ggtcccctgt 
gctggtggcc 
agtcttgaaa 
ctgctcgctc 
ggcaggggtg 
gctgaagggc 
tcataaaaca 
cacactgttg 
acagtcaggg 
cctgctcttg 
ttttgcaagc 
ctctaagact 
cctgcgcctc 
taaaccccca 
agtctgccct 
ccagacgcgc 
tgaggcctgg 
agcaaggaag 
gaaccctggc 

ggagcggcgt 
catccctgag 
ctctgacggg 
ttctggagaa 
cccagacaag 
agccactggt 
cattagtcac 
cctgggccac 
ctgtcgccac 
aggatcactt 
tgtccccctg 
cccccagcct 
tgtgtgtgtg 
gacccggctg 
gcttcatctg 
cagcagctgt 
ggaggttctc 
gccgtccagt 



gaactggctc 
gatgctatcc 
gatcagcttc 
gccaaaatac 
gaatttgaag 
agtaacattg 
ctggccctgt 
cagacagcag 
caggctctgg 
aaggcagctg 
tatgcccagt 
gacacatacc 
caccagaagg 
cagaagtcct 
taccttagcc 
ggctggcaga 
ctcctgctga 
ttggactatg 
tcagagaagc 
tgcttggagg 
ctggtgctga 
ctgcagcagg 
ctggactctg 
cctccacagc 
actgtggcca 
accaagcccc 
tgcgccagcc 
aggctggcca 
tgtcctgaag 
cccccctcct 
gcactggagg 
ctgaagtttg 
atttgggccc 
tcctggggct 
cagggccaaa 
aataatacct 
gaccggatca 
acagagagca 
ctcaaggtga 
ctggcagagg 
ggcctgagcc 
ctgaatggca 

ccccagaggg 
gaagaactga 
gaagactcag 
tgggagctgc 
gagagtgaca 
ctttctaccc 
tgtcctccta 
cgggatcctt 
cagctgctca 
gaaatagcag 
gcccgcatcc 
gaaaaggaga 
ttggccctgg 
gaaaaggaca 
cgttcagtcc 
actgacaagc 
atcgcttccc 
gaggagcccg 



aggtgctctg 
gggaagccct 
tggacgataa 
aggaaggtat 
tcaatgacct 
ccttcaacct 
ggaaggagct 
cctcactgca 
aggtcctcct 
gctcctcctg 
tacacctgga 
tgctcctttc 
tgaccaaggg 
ccaaggcttg 
tcccgtcaaa 
cacctgagat 
tgggcagtga 
gtgaaaatct 
tggtctgcca 
ccctaaaact 
agggcgagct 
ttctgttctt 
tgaagaaggt 
tcccagagga 
aggagcctgg 
agcccatacc 
ctgtcctcac 
tgggccacca 
ccgctgagcg 
tggttccaag 
gcctgaacca 
tagcagcacg 
tcacaaaact 
ggcagccacc 
aacgttctgg 
ctcagaaagg 
ggcaagctgg 
agcctgaagt 
acttcagtga 
agcctaagag 
taaagacgga 
ggagccggag 

ccagtgacca 
ctgacaactg 
cctcaggtgg 
tgaggctgga 
aggaccttgg 
tggactccat 
gtgggotcta 
atgccactgc 
cccacctcca 
accagctgca 
agcgcctctt 
gtttccagga 
ccaccctcca 
gtcccccagt 
tgaatgagtt 
gagaatggtg 
tagagaagtc 
gccctgccca 



ctaccacgac 
gcagcttctg 
agcacaggcc 
cgagcgggat 
gaactatgaa 
ggctgcagat 
gcttacaaag 
gatcctagca 
gctgctacgg 
ccacatcacc 
agaggcagca 
cctgacctgt 
tgtctctctg 
gtacttgctg 
caacctctca 
agctctcata 
cattctctca 
ggtacaaaaa 
cctgggccgc 
tacaacaaag 
ggagctggcc 
gcttgagtct 
ccacctgcag 
ggagctcttc 
ccccatagca 
caacttcctg 
agcagtctgt 
agcccagggt 
cctcacccaa 
cctcttggat 
gccatcaaac 
gataccccac 
aggtggcctc 
attaataaaa 
acgagggcgc 
tctggaaggt 
ccctcatgtc 
acccoaggcc 
tgacagtgac 
acggggcact 
tgccgtggtt 
ggccaagaag 

ggccaggcct 
gagaaaaatg 
gaagactcca 
ttccagcaag 
tcctcggctc 
ctgtgactcc 
tgcccacctc 
tttccttgtc 
cagacagctc 
ggggctgagc 
ttccttcagg 
gcgcctggct 
gcccggaacc 
cagtgtgcag 
tgatgccatc 
gacagggcgg 
tgtgctgggc 
ggaggcctcc 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 

4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
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cgcctacagg 


agctgctaca 


ggactgtggc 


tggaaatatc 


ctgaccgcac 


tctgctgaaa 


5520 


atcatgctca 


gtggtgccgg 


tgccctcacc 


cctcaggaca 


ttcaggccct 


ggcctacggg 


5580 


ctgtgcccaa 


cccagccaga 


gcgagcccag 


gagctcctga 


atgaggcagt 


aggacgtcta 




cagggcctga 


cagtaccaag 


caatagccac 


cttgtcttgg 


tcctagacaa 


ggacttgcag 


5700 


aagctgccgt 


gggaaagcat 


gcccagcctc 


caagcactgc 


ctgtcacccg 


gctgccctcc 




ttccgcttcc 


tactcagcta 


ctccatcatc 


aaagagtatg 


gggcctcgcc 


agtgctgagt 


5820 


caaggggtgg 


atccacgaag 


taccttctat 


gtcctgaacc 


ctcacaataa 


cctgtcaagc 


5880 


acagaggagc 


aatttcgagc 


caatttcagc 


agtgaagctg 


gctggagagg 


agtggttggg 




gaggtgccaa 


gacctgaaca 


ggtgcaggaa 


gccctgacaa 


agcatgattt 


gtatatctat 


finon 


gcagggcatg 


gggctggtgc 


ccgcttcctt 






gctgagctgt 


6060 


ogggcagtgg 


ccctgctgtt 


tggctgtagc 


agtgcggccc 


tggctgtgca 


tggaaacctg 


6120 


gagggggctg 


gcatcgtgct 


caagtacatc 


atggctggtt 


gccccttgtt 


tctgggtaat 


6180 




tgactgaccg 


cgacattgac 


cgctacacgg 


aagctctgct 


gcaaggctgg 


6240 


cttggagcag 




cccccttctc 


tactatgtaa 


accaggcccg 


ccaagctccc 


6300 




atcttattgg 


ggcfcgcacct 


atagcctatg 


gcttgcctgt 


ctctctgcgg 


6360 


taaccccatg 


gagctgtctt 


attgatgcta 


gaagcctcat 


aactgttcta 


cctccaaggt 


6420 


tagatttaat 


ccttaggata 


actcttttaa 


agtgattttc 


cccagtgttt 


tatatgaaac 


6480 




gatttaacct 


cagtataata 


aagatacatc 


atttaaaccc 


tgaaaaaaaa 


6540 


aaaaaaaaaa 


aa 6552 










<210> 230 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_012291 












<400> 230 














agcctcataa 


ctgttctacc 


tccaaggtta 


gatttaatcc 


ttaggataac 


tcttttaaag 


60 



<210> 231 
<211> 6317 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_013261 
<400> 231 

tagtaagaca ggtgccttca gttcactctc 
tgctctgtgt cactgtggat tggagttgaa 
atggcgtggg acatgtgcaa ccaggactct 
gctctggttg gtgaagacca gcctctttgc 
ctagatgtga acgacttgga tacagacagc 
caatcagaaa taatatccaa tcagtacaac 
gatgaagaga atgaggcaaa cttgctagca 
gtggatgaag acggattgcc ctcatttgat 
aatgaggcta gtccttcctc catgcctgac 
ccgtctctac ttaagaagct cttactggca 
tgcagtggtc tcagtaccca gaaccatgca 
gcaattgtta agactgagaa ttcatggagc 
aagccacaaa gacgtccctg. ctcggagctt 
cctcacacca aacccacaga gaacagaaac 

aagtcccaca cacagtcgca gtcacaacac 
cctctgaccc cagagtcacc aaatgacccc 
gaacgcacct taagtgtgga actctctgga 
cctcataaag ccaaccaaga taaccctttt 
aagactgtgg tgccaccacc atcaaagaag 



agtaaggggc tggttgcctg catgagtgtg 60 

aaagcttgac tggcgtcatt caggagctgg 120 

gagtctgtat ggagtgacat cgagtgtgct 180 

ccagatcttc ctgaacttga tctttctgaa 240 

tttctgggtg gactcaagtg gtgcagtgac 300 

aatgagcctt caaacatatt tgagaagata 360 

gtcctcacag agacactaga cagtctccct 4 20 

gcgctgacag atggagacgt gaccactgac 4 80 

ggcacccctc caccccagga ggcagaagag 540 

ccagccaaca ctcagctaag ttataatgaa 600 

aatcacaatc acaggatcag aacaaaccct 660 

aataaagcga agagtatttg tcaacagcaa 720 

ctcaaatatc tgaccacaaa cgatgaccct 780 

agcagcagag acaaatgcac ctccaaaaag 840 

ttacaagcca aaccaacaac tttatctctt 900 

aagggttccc catttgagaa caagactatt 960 

actgcaggcc taactccacc caccactcct 1020 

agggcttctc caaagctgaa gtcctcttgc 1080 

cccaggtaca gtgagtcttc tggtacacaa 1140 
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ggcaataact 
tcctcagtcc 
tttggtgacc 
tcacaggagc 
gggcagattt 
agcaagcagg 
gagctgaaca 
accggtgaac 

ataaattcag 
ctgagctacc 
tcttctttta 



tattactatg 
agatcacgtt 
cagcacgaga 
agggccaagc 
gtcggtaaaa 
ggtgaaattg 
acctaccgtt 
tcaaacgaaa 
tatgcagacc 
gactctctgg 
gttccctagc 
ctaaagacta 
acaaaacaaa 
atgaacacag 
catgggtgtc 
gtatgtgtgt 
ggtacatgtg 
gcagcagttc 
aaaaggaaat 
agaaaactaa 
ctgatgtctg 
aagcaagctt 
catagaacta 
ccttttgcca 
ctaacatttt 
atgttgatct 
cagcggttct 
tctctgagat 
agatgttaaa 
tacagctatg 
gaatattcaa 
gaacttactg 
agtgcaacaa 
aagcatcttg 
ttccacattt 
aatgaaattt 
caatgggtaa 
aagtcaaaaa 
aaaatgctga 
tatctttgaa 
tttttacagt 
acatatttat 
aagccagtgg 
ttcatttggg 
tgacgtgacc 
agaaaaacct 
ttcagtaaca 



ccaccaagaa 
tcactggtgg 
atgactattg 
tccaagactc 
gttcttccac 
tctctccttg 
agcacttcgg 
tgagggacag 

gactagccat 
cttgggatgg 
actctccatg 
ggatgcgctc 
ccaggtcaag 
agtcaagcca 
caagatcgcc 
ggctgaagag 
aaagggagag 
tcagacctga 
aggagtgcac 
atacctgtga 
ctgactttga 
tagattcaaa 
attttgatag 
tgaggatgac 
ttgcaagtca 
acaacaacaa 
ctgctgaaga 
agctttgctt 
gtgtatgtat 
aggactgggg 
catgaagaca 
atatatatat 
caaccaacca 
ggcatcagcc 
tctctcataa 
atatcctgtc 
tggaatctgg 
gaagtttctg 
tccactgcaa 
ttctgaggag 
gtgttcagat 
tggagtattt 
cactgtaaat 
atgtaattat 
cttatatata 
gagcaaaatg 
tcaagaaata 
tcttgtcgct 
tctcatctaa 
ccattaagaa 
tctttttaaa 
ggtaataatt 
gccagtattt 
tatgcatcct 
cagttgcaga 
aaattttcta 
aaattcagct 
atggtgttgt 
ccattttgat 
tttggagtgt 



agggccggag 
acacgaggaa 
ccagtcaatt 
tagacaacta 
agattcagac 
cagcacaaga 
tcatcccagt 
tgatttcagt 

ggatggcctg 
cacgcaatcc 
tagagattct 
tcgttcaagg 
atcaaggtct 
ctacagacac 
ctacagccgt 
ggaagaatat 
gcagaggcag 
cacaacacgg 
agtaaatctg 
tgcttttgct 
gctgtacttt 
ctcagatgac 
tttactgaaa 
agagggatgg 
tacttaggaa 
tacaacaaga 
ggcaagagac 
ttcctggagt 
gtgtgtggtg 
gcacctgacc 
cgcttaaaac 
atatatatat 
accaaccaac 
tttgtactct 
cgtaatgatt 
tctctctctc 
gtgggagagg 
tagttcatcc 
atttcaaaag 
cggacaccgg 
agtgtaattg 
ttattttatg 
gcagccttct 
gaggtgaaac 
tttgattgta 
atgctttccg 
tcctagtccc 
tgtttttctt 
aatttttcta 
atgtttttat 
atgctgtttt 
atagtatgat 
ctttcccttg 
gtataaatac 
tagcctattt 
cctaggatgc 
tttgcagaag 
tcttgatgac 
catcaggatt 
gtattcaagt 



caatccgagt 
aggaagacca 
aattccaaaa 
gaaaataaag 
cagtgctacc 
aaacagctcc 
caagctgttt 
aatgaacaat 

tttgatgaca 
tattcattgt 
gtgtcaccac 
tccttttctc 
ccaggcagta 
cgcacgcacc 
cggcccaggt 
cgcagagagt 
aaggcaattg 
acagaactga 
cgggatgatg 
gctcttgaaa 
tgtggacgca 
tttgaccctg 
gaagctcaga 
cgaatacctc 
tttctcctac 
acaacaacaa 
agaatgatat 
ctcttggtga 
tgtgtgcttg 
agaatgcgca 
ctagaacttc 
atatatatat 
caaccacaaa 
gtttttttaa 
atatgacaat 
tctctctctc 
atactgcggg 
ttaatcctga 
ccttgtcaat 
gttacattac 
ctacattctc 
tatatactat 
tttcaaaact 
aattattgta 
aaaacaaaaa 
cacatccatc 
ctaaaggtat 
tgaagtttta 
gaagatatca 
tttctctatc 
gttttaattt 
ttttacaata 
gcagagtatg 
tgatatttca 
attataaatt 
atgacaattg 
cagtgtttct 
attgcttctg 
tcatctgagt 
ttctaaattg 



tgtatgcaca 
agcggcccag 
cagaaatact 
atgtctcctc 
tgagagagac 
aagaccagga 
ttgacgacga 
tctccaaact 

gcgaagatga 
tcaatgtgtc 
ccaaatcctt 
gacacaggtc 
gatcctcttc 
gaaattctcc 
atgacagcta 
atgagaagcg 
aagagcgccg 
gggaccgttt 
gagacagcta 
atggatacac 
agcaattttt 
cttccaccaa 
gaagcttgcg 
atgggacagc 
tttacactct 
caataacaac 
ccagtaagca 
tggagtgtgc 
gtttagggga 
agggcaaacc 
aaaatgttcg 
atatataaat 
ccaccctaaa 
gaaagtgcag 
cctgaagaaa 
tctctttttt 
caccagaatg 
cacccatgta 
ggtcaagcgt 
taatgagagt 
tgatgtagtt 
acaacaatgt 
gctaaatttt 
cactaacata 
gacagtgtgt 
ccttaggtga 
taaccacttc 
tacactggat 
tgattttatg 
aacagtagtt 
ttgtgatttt 
attaatgtgt 
acgatggtat 
ttcctttgtt 
atgagatgat 
tcaggttgga 
acttgcacta 
ctaaatttaa 
gtggagtccc 
agattcgatt 



actcagcaag 
tctgcggctg 
cattaatata 
tgattggcag 
tttggaggca 
aatccgagcc 
agcagacaag 
acctatgttt 

aagtgataaa 
tccttcttgt 
attttctcaa 
gtgttcccga 
aagatcctgc 
cttgtatgtg 
cgaggaatat 
agagtctgag 
tgtgatttat 
tgaagttttt 
tggtttcatt 
tttgcgcagg 
caagtctaac 
gagcaagtat 
caggtaacat 
gcgtccttcc 
ctgtacaaaa 
aatggtttac 
catgtttatt 
gtgtgtgcat 
agtatgtgtg 
atttcaaatg 
tattctattc 
taaaaaggaa 
atgacagccg 
aatcaacttg 
ccacaggttc 
ttttcttttt 
ctaaagtttc 
aatgtccaaa 
gcagcttgtt 
tgggtagaac 
aagtatttac 
tcttttttgt 
tcttaatcaa 
tttagaagct 
gtgtctgttg 
gcttcaatct 
tgcgatattt 
ttgttagggg 
taaagtctct 
ttgaaactag 
aatttgatac 
gtctgaagac 
ttatctgtat 
tactaaagag 
gaaaataata 
gtgtaagtgc 
gcatggcctc 
taaaaacttc 
tggaatggaa 
actgtttggc 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 

1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4 680 
4740 
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tgacatgact 


tttctggaag 


acatgataca 


cctactactc 


aattgttctt 


ttcctttctc 


4800 


tcgcccaaca 


cgatcttgta 


agatggattt 


cacccccagg 


ccaatgcagc 


taattttgat 


4860 


agctgcattc 


atttatcacc 


agcatattgt 


gttctgagtg 


aatccactgt 


ttgtcctgtc 


4920 


ggatgcttgc 


ttgatttttt 


ggcttcttat 


ttctaagtag 


atagaaagca 


ataaaaatac 


4980 


tatgaaatga 


aagaacttgt 


tcacaggttc 


tgcgttacaa 


cagtaacaca 


tctttaatcc 


5040 


gcctaattct 


tgttgttctg 


taggttaaat 


gcaggtattt 


taactgtgtg 


aacgccaaac 


5100 


taaagtttac 


agtctttctt 


tctgaatttt 


gagtatcttc 


tgttgtagaa 


taataataaa 


5160 


aagactatta 


agagcaataa 


attattttta 


agaaatcgag 


atttagtaaa 


tcctattatg 


5220 


tgttcaagga 


ccacatgtgt 


tctctatttt 


gcctttaaat 


ttttgtgaac 


caattttaaa 


5280 


tacattctcc 


tttttgccct 


ggattgttga 


catgagtgga 


atacttggtt 


tcttttctta 


5340 


cttatcaaaa 


gacagcacta 


cagatatcat 


attgaggatt 


aatttatccc 


ccctaccccc 


5400 


agcctgacaa 


atattgttac 


catgaagata 


gttttcctca 


atggacttca 






agaattagtg 


gagcttttgt 


atcttctgca 


gacactgtgg 


gtagcccatc 


aaaatgtaag 


5520 


ctgtgctcct 


ctcattttta 


tttttatttt 


tttgggagag 


aatatttcaa 






gcaccccatc 


atcactggag 


gcaaatttca 


gcatagatct 


gtaggatttt 


tagaagaccg 


5640 


tgggccattg 


ccttcatgcc 


gtggtaagta 


ccacatctac 


aattttggta 


accgaactgg 


5700 


tgctttagta 


atgtggattt 


ttttcttttt 


taaaagagat 


gtagcagaat 


aattcttcca 




gtgcaacaaa 


atcaattttt 


tgctaaacga 


ctccgagaac 


aacagttggg 


ctgtcaacat 


5820 


tcaaagcagc 


agagagggaa 


ctttgcacta 


ttggggtatg 


atgtttgggt 


cagttgataa 


5880 


aaggaaacct 


tttcatgcct 


ttagatgtga 


gcttccagta 


ggtaatgatt 


atgtgtcctt 


5940 


tcttgatggc 


tgtaatgaga 


acttcaatca 




agacctgatc 


tatagatgac 


6000 


ctagaatagc 


catgtactat 


aatgtgatga 


ttctaaattt 


gtacctatgt 


gacagacatt 




ttcaataatg 


tgaactgctg 


atttgatgga 




gatttgtagg 


tgaaagtgta 


6120 


atactgttgg 


ttgaactatg 


ctgaagaggg 


aaagtgagcg 


attagttgag 


cccttgccgg 


6180 


gccttttttc 


cacctgccaa 


ttctacatgt 


attgttgtgg 


ttttattcat 


tgtatgaaaa 


6240 


ttcctgtgat 


tttttttaaa 


tgtgcagtac 


acatcagcct 


cactgagcta 


ataaagggaa 


6300 


acgaatgttt 


caaatct 6317 








<210> 232 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_013261 












<400> 232 














ctgtagtcta 


agacctgatc 


tatagatacc 


tagaatagcc 


atgtactata 


atgtgatgat 


60 


<210> 233 














<211> 3237 














<212> DNA 














<213> Homo sapiens 












<300> 














<308> NM_013277 












<400> 233 














gcgaagtgaa 


gggtggccca 


ggtggggcca 


ggctgactga 


atgtatctcc 


tagctatgga 


60 


ctaaataata 


catgggggga 


aataaacaag 


tattcatgag 


ggtgaaaatg 






ggaaaattac 


aactattttc 


aattgacgtt 


gaataggatg 


agtcatggaa 


tttaagtgat 




ttactgaaga 


ttatactact 


ggtagataga 


agagctaaag 


aaagatggat 




240 


tgaatgtgcg 


gaatctgttt 


gagcagcttg 


tgcgccgggt 


ggagattctc 


agtgaaggaa 


300 


atgaagtcca 


atttatccag 


ttggcgaagg 


actttgagga 


tttccgtaaa 


aagtggcaga 


360 


ggactgacca 


tgagctgggg 


aaatacaagg 


atcttttgat 


gaaagcagag 


actgagcgaa 


420 


gtgctctgga 


tgttaagctg 


aagcatgcac 


gtaatcaggt 


ggatgtagag 


atcaaacgga 


480 


gacagagagc 


tgaggctgac 


tgcgaaaagc 


tggaacgaca 


gattcagctg 


attcgagaga 


540 


tgctcatgtg 


tgacacatct 


ggcagcattc 


aactaagcga 


ggagcaaaaa 


tcagctctgg 


600 . 


cttttctcaa 


cagaggccaa 


ccatccagca 


gcaatgctgg 


gaacaaaaga 


ctatcaacca 


660 


ttgatgaatc 


tggttccatt 


ttatcagata 


tcagctttga 


caagactgat 


gaatcactgg 


720 


attgggactc 


ttctttggtg 


aagactttca 


aactgaagaa 


gagagaaaag 


aggcgctcta 


780 
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ctagccgaca 
ctgcagtaga 
atggcgggcc 
gaaggaaaac 
tggagccaag 
tgcatgactt 
ggataaaatt 



gtttgttgat 
ccaggggaat 
catcgaagct 
aggtacttta 
aactgagaca 
tgtttctaag 
tggcaaatta 



ggtccccctg 
gaatccatag 
gtgtccacta 
caaccttgga 
gacagtgtgg 
acggttatta 
tctctgaagt 



gacctgtaaa 
ttgcaaaaac 
ttgagactgt 
acagtgactc 
gcacgccaca 
aacctgaatc 
gtcgagactg 



gaaaactcgt 
tacagtgact 
gccatattgg 
caccctgaac 
gagtaatgga 
ctgtgttcca 
tcgtgtggtc 



tccattggct 840 

gttcccaatg 900 

accaggagcc 960 

agcaggcagc 1020 

gggatgcgcc 1080 

tgtggaaagc 1140 

tctcatccag 1200 



aatgtcggga ccgctgtccc cttccctgca ttcctaccct gataggaaca cctgtcaaga 1260 



ttggagaggg 
ttgtgcattg 
tctctggctg 
tacccctcct 
ttcgaaacct 
cagaaatcac 
cccaggccaa 
gtccacatac 
cccatgctgt 
aggtggttga 
agcaagagaa 
agacaccaga 
tcaagactcc 
ctcctagatt 
ttgcttctcc 
tataagaaag 
gcattctcca 
tatctcccct 
ggagctttta 
ttctttttgg 
tgatctttct 
tatctgataa 
tagaccagag 
aactgagcta 
tgggagtatg 
ccttgtaaat 
ggttcatttg 
ccccataccc 
tggcatccca 
aagaaaagct 
atagccacat 
aaaaacgatt 
tatttgggat 



aatgctggca 
tgtaaatgag 
tgaccgcaca 
cagcaaagtg 
caaagaacct 
agatgaagac 
cagggacaca 
taaaatggat 
gcccaatcca 
gcgcctgctt 
cattgacccc 
tattaaagtg 
ttcatctagt 
tgggagcaaa 
aatgctcaag 
gacacatctg 
ggcttttact 
tccttctcct 
gatgagacat 
ggaagggggt 
ctagtttcat 
gattagcttg 
gatttaggat 
taatatggag 
cgtagctttg 
gtgaaagtac 
gggtcaaagc 
ccaaccttgt 
actaacaata 
acagtcattt 
ctagatagat 
ttcttctgta 
aatatttgga 



gactttgtgt 
attgagcaaa 
gtaaaagagc 
gatgatatcc 
cttctgacct 
aacagcatag 
ttagctttcc 
gttgccaatc 
gacccagtga 
tccttgcctc 
ctacatgtca 
agtttactgg 
tccctgtcac 
agcaagtctg 
tgaagtcaca 
tactctgctc 
caagtttaat 
caagtcacat 
ctttccaggg 
tattgttcot 
ttagaactga 
tctcagggtg 
gcctccttct 
ctttcataaa 
atttggatga 
aactcgtatt 
cagtttttct 
ccagtggagc 
aagagtaggc 
tttctttgca 
gtgagctttt 
gaatgtttga 
aaaaaagtaa 



cccagacttc 
gaggtctgac 
tgaaagagaa 
atgctatctg 
ttcgccttaa 
ctgccatgta 
tcatgattca 
tggctaaagt 
caatgttaca 
tggagtattg 
ttgaaaactc 
gacctgtgac 
agagagtccg 
ccactaacct 
tctgcctgtt 
tgcagcctcc 
tgtgcatgag 
aatatcagca 
gtagaagggt 
ttggcttaaa 
tttccgtgag 
ggaagtggga 
aagaaccaga 
aatgggatgc 
ttaggtcttt 
tatctctgat 
tttaaaattg 
ccaacttcta 
tataagggaa 
ctttggatgc 
tcttctgtta 
cttcgtattg 
atagcttttt 



tccaatgatc 
tgagacaggc 
attcctcaga 
tagccttcta 
cagagccttt 
ccaagctgtt 
cttgcagaga 
ctttggccct 
ggacatcaag 
gagtcagttc 
aaatgccttt 
cactcctgaa 
ttccaccctc 
aggacgacaa 
acttcccagc 
tgtactcatt 
ggttttatta 
ctttgtgctg 
tagtatggaa 
gccaaatgct 
acaatgacag 
gggcagggca 
agttctcatt 
attgaggaca 
aatagtgttg 
gtgccgctgg 
aattcattct 
aaggtcaata 
gattgtcaat 
tgaaattttt 
aaattattct 
acccttatct 
caaaatgaaa 



ccctccattg 1320 

ctgtatagga 1380 

gtgaaaactg 1440 

aaagactttc 1500 

atggaagcag 1560 

ggtgaactgc 1620 

gtggctcaga 1680 

acaatagtgg 1740 

cgtcaaccca 1800 

atgatggtgg 1860 

tcaacaccac 1920 

catcagcttc 1980 

accaagaaca 2040 

ggcaactttt 2100 

attgactgac 2160 

actactttta 2220 

aaactatata 2280 

gtcattgttg 2340 

ttggttgtga 2400 

gctcatagaa 2460 

aaaccctacc 2520 

aagaaaggat 2580 

ccccattatg 2640 

gaactagtga 2700 

agtggcacaa 2760 

ctgaactttg 2820 

gatgcttggc 2880 

tatcatcctt 2940 

attttgtggt 3000 

cccatggaac 3060 

taatgtctgt 3120 

gtaaaacacc 3180 
aaaaaaa 3237 



<210> 234 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM 013277 



<400> 234 

ctcattcccc attatgaact gagctataat atggagcttt cataaaaatg ggatgcattg 

<210> 235 
<211> 1122 
<212> DNA 
<213> Homo sapiens 
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<300> 

<308> NM_013409 
<400> 235 

gctcctcgcc ccgcgcctgc ccccaggatg gtccgcgcga ggcaccagcc gggtgggctt 60 

tgcctcctgc tgctgctgct ctgccagttc atggaggacc gcagtgccca ggctgggaac 120 

tgctggctcc gtcaagcgaa gaacggccgc tgccaggtcc tgtacaagac cgaactgagc 180 

aaggaggagt gctgcagcac cggccggctg agcacctcgt ggaccgagga ggacgtgaat 240 

gacaacacac tcttcaagtg gatgattttc aacgggggcg cccccaactg catcccctgt 300 

aaagaaacgt gtgagaacgt ggactgtgga cctgggaaaa aatgccgaat gaacaagaag 360 

aacaaacccc gctgcgtctg cgccccggat tgttccaaca tcacctggaa gggtccagtc 420 

tgcgggctgg atgggaaaac ctaccgcaat gaatgtgcac tcctaaaggc aagatgtaaa 480 

gagcagccag aactggaagt ccagtaccaa ggcagatgta aaaagacttg tcgggatgtt 540 

ttctgtccag gcagctccac atgtgtggtg gaccagacca ataatgccta ctgtgtgacc 600 

tgtaatcgga tttgcccaga gcctgcttcc tctgagcaat atctctgtgg gaatgatgga 660 

gtcacctact ccagtgcctg ccacctgaga aaggctacct gcctgctggg cagatctatt 720 

ggattagcct atgagggaaa gtgtatcaaa gcaaagtcct gtgaagatat ccagtgcact 780 

ggtgggaaaa aatgtttatg ggatttcaag gttgggagag gccggtgttc cctctgtgat 840 

gagctgtgcc ctgacagtaa gtcggatgag cctgtctgtg ccagtgacaa tgccacttat 900 

gccagcgagt gtgccatgaa ggaagctgcc tgctcctcag gtgtgctact ggaagtaaag 960 

cactccggat cttgcaactc catttcggaa gacaccgagg aagaggagga agatgaagac 1020 

caggactaca gctttcctat atcttctatt ctagagtggt aaactctcta taagtgttca 1080 
gtgttcacat agcctttgtg caaaaaaaaa aaaaaaaaaa aa 1122 

<210> 236 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_013409 
<400> 236 

gaagatgaag accaggacta cagctttcct atatcttcta ttctagagtg gtaaactctc 60 

<210> 237 

<211> 11389 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_014246 
<400> 237 

atggcgccgc cgccgccgcc cgtgctgccc gtgctgctgc tcctggccgc cgccgccgcc 60 

ctgccggcga tggggctgcg agcggccgcc tgggagcogc gcgtacocgg cgggacccgc 120 

gccttcgccc tccggcccgg ctgtacctac gcggtgggcg ccgcttgcac gccccgggcg 180 

ccgcgggagc tgctggacgt gggccgcgat gggcggctgg caggacgtcg gcgcgtctcg 240 
ggcgcggggc gcccgctgcc gctgcaagtc cgcttggtgg cccgcagtgc cccgacggcg 300 
ctgagccgcc gcctgcgggc gcgcacgcac cttcccggct gcggagcccg tgcccggctc 360 
tgcggaaccg gtgcccggct ctgcggggcg ctctgcttcc ccgtccccgg cggctgcgcg 420 
gccgcgcagc attcggcgct cgcagctccg accaccttac ccgcctgccg ctgcccgccg 480 
cgccccaggc cccgctgtcc cggccgtccc atctgcctgc cgccgggcgg ctcggtccgc 540 
ctgcgtctgc tgtgcgccct gcggcgcgcg gctggcgccg tccgggtggg actggcgctg 600 
gaggccgcca ccgcggggac gccctccgcg tcgccatccc catcgccgcc cctgccgccg 660 
aacttgcccg aagcccgggc ggggccggcg cgacgggccc ggcggggcac gagcggcaga 720 
gggagcctga agtttccgat gcccaactac caggtggcgt tgtttgagaa cgaaccggcg 780 
ggcaccctca tcctccagct gcacgcgcac tacaccatcg agggcgagga ggagcgcgtg 840 
agctattaca tggaggggct gttcgacgag cgctcccggg gctacttccg aatcgactct 900 
gccacgggcg ccgtgagcac ggacagcgta ctggaccgcg agaccaagga gacgcacgtc 960 
ctcagggtga aagccgtgga ctacagtacg ccgccgcgct cggccaccac ctacatcact 1020 
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gtcttggtca aagacaccaa cgaccacagc ccggtcttcg agcagtcgga gtaccgcgag 1080 

cgcgtgcggg agaacctgga ggtgggctac gaggtgctga ccatccgcgc cagcgaccgc 1140 

gactcgccca tcaacgccaa cttgcgttac cgcgtgttgg ggggcgcgtg ggacgtcttc 1200 

cagctcaacg agagctctgg cgtggtgagc acacgggcgg tgctggaccg ggaggaggcg 1260 

gccgagtacc agctcctggt ggaggccaac gaccaggggc gcaatccggg cccgctcagt 1320 

gccacggcca ccgtgtacat cgaggtggag gacgagaacg acaactaccc ccagttcagc 1380 

gagcagaact acgtggtcca ggtgcccgag gacgtggggc tcaacacggc tgtgctgcga 1440 

gtgcaggcca cggaccggga ccagggccag aacgcggcca ttcactacag catcctcagc 1500 

gggaacgtgg ccggccagtt ctacctgcac tcgctgagcg ggatcctgga tgtgatcaac 1560 

cccttggatt tcgaggatgt ccagaaatac tcgctgagca ttaaggccca ggatgggggc 1620 

cggcccccgc tcatcaattc ttcaggggtg gtgtctgtgc aggtgctgga tgtcaacgac 1680 

aacgagccta tctttgtgag cagccccttc caggccacgg tgctggagaa tgtgcccctg 1740 

ggctaccccg tggtgcacat tcaggcggtg gacgcggact ctggagagaa cgcccggctg 1800 

cactatcgcc tggtggacac ggcctccacc tttctggggg gcggcagcgc tgggcctaag 1860 

aatcctgccc ccacccctga cttccccttc cagatccaca acagctccgg ttggatcaca 1920 

gtgtgtgccg agctggaccg cgaggaggtg gagcactaca gcttcggggt ggaggcggtg 1980 

gaccacggct cgccccccat gagctcctcc accagcgtgt ccatcacggt gctggacgtg 2040 

aatgacaacg acccggtgtt cacgcagccc acctacgagc ttcgtctgaa tgaggatgcg 2100 

gccgtgggga gcagcgtgct gaccctgcag gcccgcgacc gtgacgccaa cagtgtgatt 2160 

acctaccagc tcacaggcgg caacacccgg aaccgctttg cactcagcag ccagagaggg 2220 

ggcggcctca tcaccctggc gctacctctg gactacaagc aggagcagca gtacgtgctg 2280 

gcggtgacag catccgacgg cacacggtcg cacactgcgc atgtcctaat caacgtcact 2340 

gatgccaaca cccacaggcc tgtctttcag agctcccatt acacagtgag tgtcagtgag 2400 

gacaggcctg tgggcacctc cattgctacc ctcagtgcca acgatgagga cacaggagag 2460 

aatgcccgca tcacctacgt gattcaggac cccgtgccgc agttccgcat tgaccccgac 2520 

agtggcacca tgtacaccat gatggagctg gactatgaga accaggtcgc ctacacgctg 2580 

accatcatgg cccaggacaa cggcatcccg cagaaatcag acaccaccac cctagagatc 2640 

ctcatcctcg atgccaatga caatgcaccc cagttcctgt gggatttcta ccagggttcc 2700 

atctttgagg atgctccacc ctcgaccagc atcctccagg tctctgccac ggaccgggac 27 60 

tcaggtccca atgggcgtct gctgtacacc ttccagggtg gggacgacgg cgatggggac 2820 

ttctacatcg agcccacgtc cggtgtgatt cgcacccagc gccggctgga ccgggagaat 2880 

gtggccgtgt acaacctttg ggctctggct gtggatcggg gcagtcccac tccccttagc 2940 

gcctcggtag aaatccaggt gaccatcttg gacattaatg acaatgcccc catgtttgag 3000 

aaggacgaac tggagctgtt tgttgaggag aacaacccag tggggtcggt ggtggcaaag 3060 

attcgtgcta acgaccctga tgaaggccct aatgcccaga tcatgtatca gattgtggaa 3120 

ggggacatgc ggcatttctt ccagctggac ctgctcaacg gggacctgcg tgccatggtg 3180 

gagctggact ttgaggtccg gcgggagtat gtgctggtgg tgcaggccac gtcggctccg 324 0 

ctggtgagcc gagccacggt gcacatcctt ctcgtggacc agaatgacaa cccgcctgtg 3300 

ctgcccgact tccagatcct cttcaacaac tatgtcacca acaagtccaa cagtttcccc 3360 

accggcgtga tcggctgcat cccggcccat gaccccgacg tgtcagacag cctcaactac 3420 

accttcgtgc agggcaacga gctgcgcctg ttgctgctgg accccgccac gggcgaactg 34 80 

cagctcagcc gcgacctgga caacaaccgg ccgctggagg cgctcatgga ggtgtctgtg 3540 

tctgatggca tccacagcgt cacggccttc tgcaccctgc gtgtcaccat catcacggac 3600 

gacatgctga ccaacagcat cactgtccgc ctggagaaca tgtcccagga gaagttcctg 3660 

tccccgctgc tggccctctt cgtggagggg gtggccgccg tgctgtccac caccaaggac 3720 

gacgtcttcg tcttcaacgt ccagaacgac accgacgtca gctccaacat cctgaacgtg 3780 

accttctcgg cgctgctgcc tggcggcgtc cgcggccagt tcttcccgtc ggaggacctg 384 0 

caggagcaga tctacctgaa tcggacgctg ctgaccacca tctccacgca gcgcgtgctg 3900 

cccttcgacg acaacatctg cctgcgcgag ccctgcgaga actacatgaa gtgcgtgtcc 3960 

gttctgcgat tcgacagctc cgcgcccttc ctcagctcca ccaccgtgct cttccggccc 4020 

atccacccca tcaacggcct gcgctgccgc tgcccgcccg gcttcaccgg cgactactgc 4 080 

gagacggaga tcgacctctg ctactccgac ccgtgcggcg ccaacggccg ctgccgcagc 4140 

cgcgagggcg gctacacctg cgagtgcttc gaggacttca ctggagagca ctgtgaggtg 4200 

gatgcccgct caggccgctg tgccaacggg gtgtgcaaga acgggggcac ctgcgtgaac 4260 

ctgctcatcg gcggcttcca ctgcgtgtgt cctcctggcg agtatgagag gccctactgt 4320 

gaggtgacca ccaggagctt cccgccccag tccttcgtca ccttccgggg cctgagacag 4380 

cgcttccact tcaccatctc cctcacgttt gccactcagg aaaggaacgg cttgcttctc 4440 

tacaacggcc gcttcaatga gaagcacgac ttcatcgccc tggagatcgt ggacgagcag 4 500 

gtgcagctca ccttctctgc aggcgagaca acaacgaccg tggcaccgaa ggttcccagt 4560 

ggtgtgagtg acgggcggtg gcactctgtg caggtgcagt actacaacaa gcccaatatt 4 620 

ggccacctgg gcctgcccca tgggccgtcc ggggaaaaga tggccgtggt gacagtggat 4 680 
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gattgtgaca caaccatggc tgtgcgcttt ggaaaggaca 
gcccagggca ctcagaccgg ctccaagaag tccctggatc 
gggggtgtcc ccaacctgcc agaagacttc ccagtgcaca 
atgcggaacc tgtcagtcga cggcaaaaat gtggacatgg 
ggcacccggg aaggctgcgc tgctcggagg aacttctgcg 
ggaggcacct gtgtcaacag gtggaatatg tatctgtgtg 
gggaagaact gtgagcaagc catgcctcac ccccagctct 
tcctggagtg acctgaacat catcatctct gtgccctggt 
acccggaagg aggacagcgt tctgatggag gccaccagtg 
ctccagatcc tgaacaacta cctccagttt gaggtgtccc 
tccgtgatgc tgtccgggtt gcgggtgacc gacggggagt 
ctgaagaatg ttaaggagga cagtgagatg aagcacctgg 
gggatggacc agaacaaggc agatatcggg ggcatgcttc 
gtggtggtcg gaggcgcctc tgaagacaag gtctccgtgc 
atgcagggag tgaggatggg ggggacgccc accaacgtcg 
gcactcaagg tcagggtgaa ggacggctgt gatgtggacg 
tgtcccccca atagccgctg ccacgacgcc tgggaggact 
gggtaccttg gaataaactg tgtggatgcc tgtcacctga 
gcctgcgtgc gctcccccgg ctccccgcag ggctacgtgt 
tacgggccgt actgtgagaa caaactcgac cttccgtgcc 
cccgtctgtg gaccctgcca ctgtgccgtc agcaaaggct 
accaacggcc agtgccaatg caaggagaat tactacaagc 
ctgccctgcg actgcttccc ccatggctcc cacagccgca 
cagtgtgcct gcaagcccgg cgtcatcggc cgccagtgca 
gccgaggtca ccacgctcgg ctgtgaagtg atctacaatg 
gccggcatct ggtggccaca gaccaagttc gggcagccgg 
ggatccgttg gaaatgcggt ccgacactgc agcggggaga 
ctctttaact gtaccaccat ctccttcgtg gacctcaggg 
cgcaatgaga cgcaggtgga cggcgccagg gccctgcagc 
gctacacagc acacgggcac gctctttggc aatgacgtgc 
ggccacgtcc ttcagcacga gagctggcag cagggcttcg 
gccgactttc acgaggacgt catccactcg ggcagcgccc 
gcggcgtggg agcagatcca gcggagcgag ggcggcacgg 
gagggctact tcagcaacgt ggcacgcaac gtgcggcgga 
atcgtcaccg ccaacatgat tcttgctgtc gacatctttg 
gccagggtcc cgcgattcga caccatccat gaagagttcc 
gtctccttcc cagccgactt cttcagacca cctgaagaaa 
ccggctggcc ggaggaccac cccgcagacc acgcgcccgg 
gccccgatca gcaggcggag gcgacaccct gatgacgctg 
gtcatcattt accgcaccct ggggcagctc ctgcccgagc 
agcctccggt tgcctcaccg gcccatcatt aataccccga 
agcgaggggg ctccgctccc gagacccctg gagaggcccg 
ctggaggtgg aggagcgaac caagcctgtc tgcgtgttct 
ggtgggacgg gagggtggtc tgcccggggc tgcgagctcc 
gtcgcctgcc agtgcagcca cacagccagc tttgcggtgc 
gagaacgggg aggtcctgcc tctgaagatt gtcacctatg 
gcagccctgc tggtggcctt cgtcctcctg agcctggtcc 
cacagcattc acaagcacct cgccgtggcg ctcttcctct 
gggatcaacc agacggaaaa cccgtttctg tgcacagtgg 
atctacatga gcacctttgc ctggaccctc gtggagagcc 
accgaggtgc gcaacatcga cacggggccc atgcggttct 
atcccggcca ttgtcacagg actggcggtc ggcctggacc 
gacttctgct ggctgtcgct tcaagacacc ctgatttgga 
gctgttataa tcatcaacac agtcacttct gtcctatctg 
aagcaccatt attatgggaa aaaagggatc gtctccctgc 
ctgctgctca tcagcgccac ctggctgctg gggctgctgg 
agctttcact acctcttcgc catcttcagc ggcttacagg 
cactgcgtgc tcaaccagga ggtccggaag cacctgaagg 
ctgcacctgg aggactccgc caccaccagg gccaccctgc 
aacaccacct tcggtgacgg gcctgacatg ctgcgcacag 
tcgctggaca gcatcgtcag ggatgaaggg atccagaagc 



tcgggaacta cagctgcgct 
tgaccggccc tctactcctg 
accggcagtt cgtgggctgc 
ccggattcat cgccaacaat 
atgggaggcg gtgtcagaat 
agtgtccact ccgattcggc 
tcagcggtga gagcgtcgtg 
acctggggct catgttccgg 
gtgggcccac cagctttcgc 
acggcccctc cgatgtggag 
ggcaccacct gctgatcgag 
tcaccatgac cttggactat 
ccgggctgac ggtaaggagc 
gccgtggatt ccgaggctgc 
ccaccctgaa catgaacaac 
acccctgtac ctcgagcccc 
acagctgcgt ctgtgacaaa 
acccctgcga gaacatgggg 
gcgagtgtgg gcccagtcac 
ccagaggctg gtgggggaac 
ttgatcccga ctgtaataag 
tcctagccca ggacacctgt 
cttgcgacat ggccaccggg 
accgctgcga caacccgttt 
gctgtcccaa agcatttgag 
ctgcggtgcc atgccctaag 
agggctggct gcccccagag 
ccatgaatga gaagctgagc 
tggtgagggc gctgcgcagt 
gcacggccta ccagctgctg 
acctggcagc cacgcaggac 
tcctggcccc agccaccagg 
cacagctgct ccggcgcctc 
cgtacctgcg gcccttcgtc 
acaagttcaa ctttacggga 
ccagggagct ggagtcctcc 
aagaaggccc cctgctgagg 
ggcctggcac cgagagggag 
gccagttcgc cgtcgctctg 
gctacgaccc cgaccgtcgc 
tggtgagcac gctggtgtac 
tcctggtgga gttcgccctg 
ggaaccactc cctggccgtt 
tgtccaggaa ccggacacat 
tcatggatat ctccaggcgt 
ccgctgtgtc cttgtcactg 
gcatgctgcg ctccaacctg 
ctcagctggt gttcgtgatt 
ttgccatcct cctccactac 
tgcatgtcta ccgcatgctg 
actacgtcgt gggctggggc 
cccagggcta cgggaacccc 
gctttgcggg gcccatcgga 
caaaggtttc ctgccaaaga 
tgaggaccgc attcctcctg 
ctgtgaaccg cgatgcactg 
gccccttcgt cctccttttc 
gcgtgctcgg cgggaggaag 
tgacgcgctc cctcaactgc 
acttgggcga gtccaccgcc 
tcggcgtgtc ctctgggctg 



4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6.540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
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gtgaggggca gccacggaga gccagacgcg tccctcatgc ccaggagctg caaggatccc 8400 

cctggccacg attccgactc agatagcgag ctgtccctgg atgagcagag cagctcttac 8460 

gcctcctcac actcgtcaga cagcgaggac gatggggtgg gagctgagga aaaatgggac 8520 

ccggccaggg gcgccgtcca cagcaccccc aaaggggacg ctgtggccaa ccacgttccg 8580 

gccggctggc ccgaccagag cctggctgag agtgacagtg aggaccccag cggcaagccc 8 640 

cgcctgaagg tggagaccaa ggtcagcgtg gagctgcacc gcgaggagca gggcagtcac 8700 

cgtggagagt accccccgga ccaggagagc gggggcgcag ccaggcttgc tagcagccag 8760 

cccccagagc agaggaaagg catcttgaaa aataaagtca cctacccgcc gccgctgacg 8820 

ctgacggagc agacgctgaa gggccggctc cgggagaagc tggccgactg tgagcagagc 8880 

cccacatcct cgcgcacgtc ttccctgggc tctggcggcc ccgactgcgc catcacagtc 8940 

aagagccctg ggagggagcc ggggcgtgac cacctcaacg gggtggccat gaatgtgcgc 9000 

actgggagcg cccaggccga tggctccgac tctgagaaac cgtgaggcaa gcccgtcacc 9060 

ccacacaggc tgcggcatca ccctcagacc ttggagccca aggggccact gcccttgaag 9120 

tggagtgggc ccagagtgtg gcggtcccca tggtggcagc cccccgactg atcatccaga 9180 

cacaaaggtc ttggttctcc caggagctca gggcctgtca gacctggtga caagtgccaa 9240 

aggccacagg catgagggag gcgtggacca ctgggccagc accgctgagt cctaagactg 9300 

cagtcaaagc cagaactgag aggggacccc agactgggcc cagaggctgg ccagagttca 9360 

ggaacgccgg gcacagacca aagaccgcgg tccagccccg cccaggcggg catctcatgg 9420 

cagtgcggac ccgtggctgg cagcccgggc agtcctttgc aaaggcaccc cttgtcttaa 9480 

aatcacttcg ctatgtggga aaggtggaga tacttttata tatttgtatg ggactctgag 9540 

gaggtgcaac ctgtatatat attgcattcg tgctgacttt gttatcccga gagatccatg 9600 

caatgatctc ttgctgtctt ctctgtcaag attgcacagt tgtacttgaa tctggcatgt 9660 

gttgacgaaa ctggtgcccc agcagatcaa aggtgggaaa tacgtcagca gtggggctaa 9720 

aaccaagcgg ctagaagccc tacagctgcc ttcggccagg aagtgaggat ggtgtgggcc 9780 

ctccccgccg gccccctggg tccccagtgt tcgctgtgtg tgcgtttgtc ctctgctgcc 9840 

atctgccccg gctgtgtgaa ttcaagacag ggcagtgcag cactaggcag gtgtgaggag 9900 

ccctgctgag gtcactgtgg ggcacggttg ccacacggct gtcatttttc acctggtcat 9960 

tctgtgacca ccaccccctc ccctcaccgc ctcccaggtg gcccgggagc tgcaggtggg 10020 

gatggctttg tcctttgctc ctgctccccg tgggacctgg gaccttaaag cgttgcaggt 10080 

tcctgatttg gacagaggtg tggggccttc caggccgtta catacctcct gccaattctc 10140 

taactctctg agactgcgag gatctccagg cagggttctc ccctctggag tctgaccaat 10200 

tacttcattt tgcttcaaat ggccaattgt gcagagggac aaagccacag ccacactctt 10260 

caacggttac caaactgttt ttggaaattc acaccaaggt cgggcccact gcaggcagct 10320 

ggcacagcgt ggcccgaggg gctgtggaac gggtcccgga actgtcagac atgtttgatt 10380 

ttagcgtttc ctttgttctt caaatcaggt gcccaaataa gtgatcagca cagctgcttc 10440 

caaataggag aaaccataaa ataggatgaa aatcaagtaa aatgcaaaga tgtccacact 10500 

gttttaaact tgaccctgat gaaaatgtga gcactgttag cagatgccta tgggagagga 10560 

aaagcgtatc tgaaaatggt ccaggacagg aggatgaaat gagatcccag agtcctcaca 10620 

cctgaatgaa ttatacatgt gccttaccag gtgagtggtc tttcgaagat aaaaaactct 10680 

agtcccttta aacgtttgcc cctggcgttt cctaagtacg aaaaggtttt taagtcttcg 10740 

aacagtctcc tttcatgact ttaacaggat tctgccccct gaggtgtaat ttttttgttc 10800 

tatttttttc cacgtactcc acagccaaca tcacgaggtg taatttttaa tttgatcaga 10860 

actgttacca aaaaacaact gtcagtttta ttgagatggg aaaaatgtaa acctattttt 10920 

attacttaag actttatggg agagattaga cactggaggt ttttaacaga acgtgtattt 10980 

attaatgttc aaaacactgg aattacaaat gagaagagtc tacaataaat taagattttt 11040 

gaatttgtac ttctgcggtg ctggtttttc tccacaaaca cccccgcccc tccccatgcc 11100 

cagggtggcc gtggaaggga cggtttacgg acgtgcagct gagctgtccg tgtcccatgc 11160 

tccctcagcc agtggaacgt gccggaactt tttgtccatt ccctagtagg cctgccacag 11220 

cctagatggg cagtttttgt ctttcaccaa atttgaggac tttttttttt tgccattatt 11280 

tcttcagttt tcttttcttg cactgatctt tctcctctcc ttctgtgact ccagtgactc 11340 
agaogttaga cctcttgatg ttttcccact ggtccctgag gctctgttc 11389 

<210> 238 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_014246 
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<400> 238 

gggagagatt agacactgga ggtttttaac agaacgtgta tttattaatg ttcaaaacac 

<210> 239 
<211> 4372 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM 014314 



<400> 239 

tagttattaa 

cctgctttcc 

cctgcgggga 

gcctgcaagc 

acatggcccc 

agggcccaat 

gctggttccg 

ccattgaaag 

aacgtttaca 

ctgaatgttt 

tgatggcagg 

ccaaaacttt 

tagagaaagg 

acatacagat 

caccttcaga 

tagagcttgc 

gtggaaaaac 

gacaaaaggg 

ctgtattctc 

cagctgagaa 

cacagattct 

tgatgatatt 

ttaattatct 

tgactgcctc 

gcaagctgtg 

aactggagca 

gcgacaaatt 

gaatctgcaa 

aatatgaaca 

aagatgaaga 

ataatgatgc 

aagacttctt 

agagatttga 

atcctaaact 

caataacaat 

aaggaaatcc 

atcagaacac 

gtggagatca 

agtgcaatct 

gaggcagagg 

ttgaaaaaga 

ttcagacatg 

aattcatcag 

tgctctgcag 

aatgccatta 

ccaagccaaa 

actgcagcca 

aaattgaaag 

ggaaggactt 



agttcctatg 
ccgctcgcca 
acgtagctag 
cttccaggat 
ctggtttagg 
ggaggctgcc 
tggctttttg 
ttgggatttc 
accagaattt 
aattaatcag 
tgcagagaaa 
gaaacttgct 
tataaaagat 
tttctaccaa 
agtgtctgat 
tttgcctgct 
ctttgtttca 
gaaagttgtc 
aaaatacttt 
tgtcccagtg 
tgtgaacaac 
tgatgaatgc 
agatcagaaa 
ggttggtgtt 
tgcttctctt 
agttgtttat 
taaatacatc 
agacctcgaa 
atggattgtt 
gagcaggatt 
cctcattatc 
cagcaatgtc 
agaaaagctg 
tgaagacctc 
tctctttgtg 
taaactcagt 
aggaatgacc 
caatattctg 
tgtcatcctt 
aagagcaaga 
acaaataaac 
ggacgaagca 
agatagtcaa 
aaagtgcaaa 
cactgtgctt 
gcagttttca 
tgactgggga 
ttttgtggtg 
tcattttgag 



cagctccgcc 
cgccctcctc 
ctgcaagcag 
tatatccgga 
gaggaagagg 
acactttttc 
gatgccctag 
aaaaaaattg 
aaaaccagaa 
gaatgtgaag 
ttggtggaat 
ttggagaaag 
gttgaaacag 
gaagatccag 
acaaacttgt 
atgaaaggaa 
ctgcttatat 
ttttttgcga 
gaaagacatg 
gaacagattg 
cttaaaaagg 
cacaacacta 
cttggaggat 
ggggatgcca 
gatgcgtcag 
aagccccaga 
atagctcagc 
aacttatctc 
acagttcaga 
tgtaaagccc 
agtgagcatg 
cgagcagcag 
caggaactag 
tgcttcatct 
aaaaccagag 
tttctaaaac 
ctcccggcac 
attgccacct 
tatgagtatg 
ggtagcaagt 
atgtacaaag 
gtatttaggg 
gaaaaaccaa 
gccttggcat 
ggagatgctt 
agttttgaaa 
atccatgtga 
gaggatattg 
aagataccat 



tcgcgtccgg 
ctacccggct 
aggccggcat 
agaccctgga 
tgcagtatat 
tcaagttcct 
accatgcagg 
aaaagttgga 
ttatcccaac 
aaattctaca 
gccttctcag 
aaaggaacaa 
aagatcttga 
aatgccagaa 
acagcccatt 

gtgaacatca 
atcagatccc 
ggtatagagt 
ttgagaacaa 
gaacgattcc 
gtaaacaaca 
cttcaggccc 
aaaacacaga 
tgatagcaac 
agtttttcag 
tgatgaggga 
aaattcaaaa 
aagcatgcat 
tgtttttata 
cacgaatgaa 
gattcgatga 
aaagtgtttc 
tacaagaaga 
cacttgtgga 
ctggcatatt 
agaagtgtat 
cagttgctga 
tgggcaatgt 
gcttccttct 
aaaaaatgat 
aaaagattct 
aacctgtacc 
gttacacagc 
ttaaggaatg 
aaagagcaaa 
agtacaagac 
caactggagt 
ttgatccagc 



cctcatttcc 
ttaaagctag 
gaccaccgag 
ccctacctac 
tcaggctgag 
gttggagctc 
ttattctgga 
ggagtataga 
cgatatcatt 
gatttgctct 
atcagacaag 
gttcagtgaa 
ggataagatg 
tcttagtgag 
taaaccaaga 
aatatgtgct 
tcttaaaaaa 
agtgtatgaa 
tacaggcatt 
tgacatcatc 
atcactatcc 
cccgtacaat 
actgccccag 
tgaagccttg 
agtcaaacac 
gaaagtggaa 
cacagagagt 
tagggaattt 
ggtgttccag 
cacttcacat 
agatgctctg 
gattgagcaa 
cagggatccc 
gtaccactta 
cgctttaaaa 
gactggacgt 
attggatgca 
tgaaggcatt 
catcaaaatg 
gactagtaat 
gaatgactct 
gcatatacag 
tgataaggaa 
tgacgtaaga 
ctttgtgagt 
gatattctgt 
atttgagatt 
tcagacactg 
agaaatgtcc 



tcggaaaatc 
tgaggcacag 
cagcgacgca 
atcctgagct 
aaaaacaaca 
caggaggaag 
ctttatgaag 
ttacttttaa 
tctgatctgt 
actaagggga 
gaaaactggc 
ctgtggattg 
gaaacttctg 
aattcatgtc 
aattaccaat 
cctacaggtt 
ttcccacaag 
cagcagaaat 
tctggagcaa 
attttaactc 
atctttactt 
atgatcatgt 
gtcattgggc 
gattatatct 
aatctggagg 
tcacggatta 
ctggcaaaga 
ggaacacaga 
atgccagaca 
ttgcggaaat 
gattacttga 
gatcttactc 
agcaatgaga 
aacccagaga 
aattggattg 
ggcaaaacaa 
ttcaaagcca 
gacattgcac 
atccaaacca 
gctggtgtaa 
attttacgcc 
actcatgaaa 
aataaaaaac 
gtgatagagg 
agaccacatc 
gcccgacaga 
ccagttataa 
tactcgaagt 
aaatgatatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
■ 2940 
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aggtcctcaa 
gtgggtataa 
agcagtttgt 
tttgtaggcc 
tctttttaga 
tgaatgaatg 
ggcttttgcc 
tgccaaatcc 
gagcactctt 
ttttattgag 
tactgtacat 
atatgtactc 
cataaagcca 
caatgtgtat 
cttcttgaaa 
ccattgttct 
gtctgttctc 
aaagattaga 
ttcaaagagc 
atcaacggga 
caggcgggga 
gtaaaacaag 
tgtgaaaagt 
taagacccaa 



tcttcagcta 
tcatggatcg 
actatattat 
aacagagctc 
gtcaactgcc 
gaatgttggg 
cttaacaatc 
actgggcact 
cttgaagctc 
tcctcctctt 
atgtatatgt 
atgttcctgg 
tttttatgct 
ccactctaca 
acttagttta 
ctttgaccat 
agatcccttg 
atatgtatcc 
ttagtttgcg 
tgaggtgtta 
gccagagaaa 
gataatactg 
acatgacaca 
tacaaatgga 



cagggaatga 
cttgtacccc 
atatgcaaag 
atagtacttg 
aacaaacaca 
aatttttatc 
tactgaaatt 
tcttaacctt 
tccctgggct 
tgctgatccc 
aactaatata 
tgtagcaacg 
tactactaaa 
gtggtcagct 
ctatacatct 
tttcctataa 
gatgttacag 
aaatggcatt 
gtttcctgga 
cagctgcctc 
cttgccagtt 
aactgtaagg 
gtagttgctt 
ttatttaaac 



gtaactttga 
tgtgaaaata 
cacaaatgag 
ggaaaaatta 
cagtaatcac 
tccctttgtc 
gttcttttga 
ctatttgacc 
tctctctctt 
ttccaagggt 
catacataca 
tgtggtatgg 
agctgtccac 
tttagtagag 
tgccctatta 
tgatgttgat 
atgaggcagt 
cacgtgtcac 
cgtggaaaca 
cctcttcatg 
atataacttc 
gttagtggag 
gataatagtt 
caaaaaaaaa 



gtggagaaga 
tattttttaa 
tgaatcacag 
aaaagcctca 
tctgtacaca 
tccttaacct 
aggttaccag 
tctgcgcatt 
ctagttctat 
tcaatatata 
ggtatgtata 
ctacacagag 
tgtagagttg 
agcataaaaa 
atatgttctc 
gttcaacacc 
ctgactgtcc 
ttagcaaggt 
agtatctgag 
caatctggtg 
tctttggctt 
agtttttaat 
actagtagta 
aaaaaaaaaa 



aacaaacata 
aaatatcttt 
cactgagtat 
tttctagcct 
ctgggataga 
actgtaaact 
tgactctggt 
tggccctgtt 
tctagtcttt 
tacatgtata 
tgtaatggtt 
aacatgagaa 
ctgtatgtag 
tgataaaata 
ttaacgtgtg 
tggactgaat 
tttctacttg 
ttgctgatgc 
ttccctggag 
agcagtggtg 
ttcttcatct 
taaaagaatg 
gtattcttac 
aa 4372 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 



<210> 240 
<211> 60 . 
<212> DNA 
<213> Homo sapiens 



<400> 240 

agttcagaca ctgtactcga agtggaagga ctttcatttt gagaagatac catttgatcc 



<210> 241 
<211> 1647 
<212> DNA 
<213> Homo sapiens 



<400> 241 

gcgcgcgggt 

atggggtcgg 

ctgaggaaag 

cgcaccacgg 

tgccccttgg 

agctgtctta 

ctagctgtac 

tatgagtcca 

acttctgctg 

atggtagcca 

aagattggac 

aagaagatag 

aaaccacaga 

gaaaatgctg 

acattccaaa 



ttcgttgacc 
agctgatcgg 
cagaggagta 
agaccagcag 
acagggctta 
aatcttttga 
agtttagctg 
gtcttcccca 
cactgctttc 
catccggtgt 
agcaggtcga 
tggttgaagc 
aagatgaaga 
ccagtgctca 
ctgatagtac 



cgcggcgttc 
gcgcctagcc 
cttgcgcctg 
tgcagtcatg 
tttaattaaa 
gtgtttactg 
tatagaagca 
gacacagcaa 
agcatgcaag 
aaaaaaagct 
cagagaacct 
cccagcaaag 
tctgacacag 
aaaggctaca 
attgccatct 



acgggaattg 
ccgcgcctgg 
tcccgggtga 
tgcctggacc 
ctttctggtt 
ggcctgaatt 
gtgaacatgg 
gtggatcttg 
attctaaagc 
atatttgatc 
ggagatgtag 
gaaatggaga 
gattatgaag 
gcagagtgat 
ccaggaagac 



ttcgctttag 
gcctcgccga 
agtgtgtcgg 
ttgcagcttc 
tgaacaagga 
caaatattgg 
cttcaaagat 
acttatccag 
tgaaagtgga 
gactgtgtaa 
ctactccacc 
aggtagagga 
aatggaaaag 
ttcagcttcc 
ttgacggctt 



tgccggcgcc 
gcccgacatg 
cctctccgca 
ctggatgaag 
gacatatcag 
aataagagac 
actaaaaagc 
gccacttttc 
taaaaacaaa 
acaactagag 
acggaagaga 
gatgccacat 
aaaaattttg 
aaactggtat 
tgggattttg 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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tttaaacttt tataataagg atcctaagac tgttgccttt aaatagcaaa gcagcctacc • 960 

tggaggctaa gtctgggcag tgggctggcc cctggtgtga gcattagacc agccacagtg 1020 

cctgattggt atagccttat gtgctttcct acaaaatgga attggaggcc gggcgcagtg 1080 

gctcacgcct gtaatcccag cactttggga ggccaaggtg ggtggatcac ctgaggtcag 1140 

gagctcgaga ccagcctggc caacatggtg aaaccccatc tctactaaaa atacaaaaat 1200 

tagccaggtg tgatggtgca tgcctgtaat cccagctcct cagtaggctg agacaggagc 1260 

atcacttgaa cgtgggaggc agaggttgca gtgagccgag attgcaccac cgcactccag 1320 

cctgggtgac agagcgagac ttatctcata aataaataga tagatactcc agcctgggtg 1380 

acagagcgag acttatagat agatagatag atagatggat agatagatag atagatagat 1440 

agatagataa acggaattgg agccattttg ctttaagtga atggcagtcc cttgtcttat 1500 

tcagaatata aaattcagtc tgaatggcat cttacagatt ttacttcaat ttttgtgtac 1560 

ggtatttttt atttgactaa atcaatatat tgtacagcct aagttaataa atgttattta 1620 
tatatgcaaa aaaaaaaaaa aaaaaaa 1647 

<210> 242 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD14321 
<400> 242 

tgctttaagt gaatggcagt cccttgtctt attcagaata taaaattcag tctgaatggc 60 

<210> 243 
<211> 1455 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_014364 
<400> 243 

ggcggtccgc acgcacctcg gtaacatcac agcaggtcca ggccaatgat aaccttataa 60 

gaggccatgt cgaagcgcga catcgtcctc accaatgtca ccgttgtcca gttgctgcga 120 

cagccgtgcc cggtgaccag agcaccgccc ccacctgagc ctaaggctga agtagagccc 180 

cagccacaac cagagcccac accagtcagg gaggaaataa agccaccacc gccaccactg 240 

cctcctcacc ccgctactcc tcctcctaag atggtgtctg tggcccggga gctgactgtg 300 

ggcatcaatg gatttggacg catcggtcgc ctggtcctgc gcgcctgcat ggagaagggt 360 

gttaaggtgg tggctgtgaa. tgatccattc attgacccgg aatacatggt gtacatgttt 420 

aagtatgact ccacccacgg ccgatacaag ggaagtgtgg aattcaggaa tggacaactg 480 

gtcgtggaca accatgagat ctctgtctac cagtgcaaag agcccaaaca gatcccctgg 540 

agggctgtcg ggagccccta cgtggtggag tccacaggcg tgtacctctc catacaggca 600 

gcttcggacc acatctctgc aggtgctcaa cgtgtggtca tctccgcgcc ctcaccggat 660- 

gcaccaatgt tcgtcatggg tgtcaatgaa aatgactata accctggctc catgaacatt 720 

gtgagcaacg cgtcctgcac caccaactgt ttggctcccc tcgccaaagt catccacgag 780 

cgatttggga tcgtggaagg gttgatgacc acagtccatt cctacacggc cacccagaag 84 0 

acagtggacg ggccatcaag gaaggcctgg cgagatgggc ggggtgccca ccagaacatc 900 

atcccagcct ccactggggc tgcgaaagct gtgaccaaag tcatcccaga gctcaaaggg 960 

aagctgacag ggatggcgtt ccgggtacca accccggatg tgtctgtcgt ggacctgacc 1020 

tgccgcctcg cccagcctgc cccctactca gccatcaagg aggctgtaaa agcagcagcc 1080 

aaggggccca tggctggcat ccttgcctac accgaggatg aggtcgtctc tacggacttc 114 0 

ctcggtgata cccactcgtc catcttcgat gctaaggccg gcattgcgct caatgacaat 1200 

ttcgtgaagc tcatttcatg gtacgacaac gaatatggct acagtcaccg ggtggtcgac 1260 

ctcctccgct acatgttcag ccgagacaag tgaaacggga aggtcctttc tttccttccc 1320 

aggggccggg gccggaacat gtgcctcccg ttccagcatc tggctgcccg ggggaggaag 1380 

gacacccggg gcgggcgccc cacgccgatg ggtccatggt gaaataaaaa acagtgctcg 14 40 
aaaaaaaaaa aaaaa 1455 
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<210> 244 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NMJD14364 
<400> 244 

cgctcaatga caatttcgtg aagctcattt catggtacga caacgaatat ggctacagtc 60 

<210> 245 
<211> 935 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_014462 
<400> 245 

gaagtgggta agggtaatat ggaggagctt ccggcaggcc ccggcggctg aaagccgggg 60 

cagaagtgct ggtctcggtc gggattccgg gcttggtccc accgaggcgg cgactgcggt 120 

aggagggaag aggttttgga cgcgctggcc tcccgccgct gtgcattgca gcattatttc 180 

agttcaaaat gaactatatg cctggcaccg ccagcctcat cgaggacatt gacaaaaagc 24 0 

acttggttct gcttcgagat ggaaggacac ttataggctt tttaagaagc attgatcaat 300 

ttgcaaactt agtgctacat cagactgtgg agcgtattca tgtgggcaaa aaatacggtg 360 

atattcctcg agggattttt gtggtcagag gagaaaatgt ggtcctacta ggagaaatag 420 

acttggaaaa ggagagtgac acacccctcc agcaagtatc cattgaagaa attctagaag 480 

aacaaagggt ggaacagcag accaagctgg aagcagagaa gttgaaagtg caggccctga 54 0 

aggaccgagg tctttccatt cctcgagcag atactcttga tgagtactaa tcttttgccc 600 

agaggctgtt ggctcttgaa gagtaggggc tgtcactgag tgaaagtgac atcctggcca 660 

cctcacgcat ttgatcacag actgtagagt tttgaaaagt cacttttatt tttaattatt 720 

ttacatatgc aacatgaaga aatcgtgtag gtgggttttt tttttaataa caaaatcact 780 

gtttaaagaa acagtggcat agactccttc acacatcact gtggcaccag caactacttc 840 

tttatattgt tcttcatatc ccaaattaga gtttacaggg acagtcttca tttacttgta 900 

aataaaatat gaatctcaaa aaaaaaaaaa aaaaa 935 

<210> 246 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_014 4 62 
<400> 246 

ttaataacaa aatcactgtt taaagaaaca gtggcataga ctccttcaca catcactgtg 60 

<210> 247 
<211> 890 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_014501 



ggcggaccga agaacgcagg aagggggccg gggggacccg cccccggccg gccgcagcca 60 

tgaactccaa cgtggagaac ctacccccgc acatcatccg cctggtgtac aaggaggtga 120 

cgacactgac cgcagaccca cccgatggca tcaaggtctt tcccaacgag gaggacctca 180 

ccgacctcca ggtcaccatc gagggccctg aggggacccc atatgctgga ggtctgttcc 240 



<400> 247 
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gcatgaaact cctgctgggg aaggacttcc ctgcctcccc acccaagggc tacttcctga 300 

ccaagatctt ccacccgaac gtgggcgcca atggcgagat ctgcgtcaac gtgctcaaga 360 

gggactggac ggctgagctg ggcatccgac acgtactgct gaccatcaag tgcctgctga 420 

tccaccctaa ccccgagtct gcactcaacg aggaggcggg ccgcctgctc ttggagaact 480 

acgaggagta tgcggctcgg gcccgtctgc tcacagagat ccacgggggc gccggcgggc 540 

ccagcggcag ggccgaagcc ggtcgggccc tggccagtgg cactgaagct tcctccaccg 600 

accctggggc cccagggggc ccgggagggg ctgagggtcc catggccaag aagcatgctg 660 

gcgagcgcga taagaagctg gcggccaaga aaaagacgga caagaagcgg gcgctgcggg 720 

cgctgcggcg gctgtagtgg gctctcttcc tccttccacc gtgaccccaa cctctcctgt 780 

cccctccctc caactctgtc tctaagttat ttaaattatg gctggggtcg gggagggtac 840 
agggggcact gggacctgga tttgtttttc taaataaagt tggaaaagca 8 90 

<210> 248 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_014501 
<400> 248 

acacgtactg ctgaccatca agtgcctgct gatccaccct aaccccgagt ctgcactcaa 60 

<210> 249 
<211> 1182 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_016095 
<400> 249 

gcggccggcg gcgtctcctc ccgggacgct gaggggcccg aggagaccgt gaggctctgg 60 

cctgcagctc gcgccgccat ggacgctgcc gaggtcgaat tcctcgccga gaaggagctg 120 

gttaccatta tccccaactt cagtctggac aagatctacc tcatcggggg ggacctgggg 180 

ccttttaacc ctggtttacc cgtggaagtg cccctgtggc tggcgattaa cctgaaacaa 240 

agacagaaat gtcgcctgct ccctccagag tggatggatg tagaaaagtt ggagaagatg 300 

agggatcatg aacgaaagga agaaactttt accccaatgc ccagccctta ctacatggaa 360 

cttacgaagc tcctgttaaa tcatgcttca gacaacatcc cgaaggcaga cgaaatccgg 420 

accctggtca aggatatgtg ggacactcgt atagccaaac tccgagtgtc tgctgacagc 480 

tttgtgagac agcaggaggc acatgccaag ctggataact tgaccttgat ggagatcaac 540 

accagcggga ctttcctcac acaagcgctc aaccacatgt acaaactccg cacgaacctc 600 

cagcctctgg agagtactca gtctcaggac ttctagagaa aggcctggtg caggcggctt 660 

gctgggggat gtgagcgctc aggatgtgat gaggtactcg tggttctgga gctctagaaa 720 

cacttctgat gcatgaaaaa tgtgtgatgg tgcaaggaat ggattcagga tgttgttgga 780 

gaaacaagtt tgtgattagt ccttaaaact tagctccctg ggacattctt caattccaca 840 

tctgtttcta gaaaccagcc ctttttcccc ccacttttga gaaataaaaa agccttaggt 900 

aaataagtca ttctccctag cagagccact tgggtctcct gcatggaagc cgtcacactt 960 

gggcaggtgt tcagtgactg gtaggtgtag atacagcagg agtggccatg tggtccacgg 1020 

ctttttaccc cttcttgatc ctgatttctt gggctgaatt tagactctct cacagaggtg 1080 

gctcacagag aaggatggca gatggtgcag ccaacaatgc tgaccggtgc ttatcctcta 1140 
agccctgatc cacaataaaa atggacccaa ctcaaaaaaa aa 1182 

<210> 250 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_016095 
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<400> 250 

atggattcag gatgttgttg gagaaacaag tttgtgatta gtccttaaaa cttagctccc 60 

<210> 251 
<211> 704 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_016185 
<400> 251 

tgcagcggtg gtcggctgtt gggtgtggag tttcccagcg cccctcgggt ccgacccttt 60 

gagcgttctg ctccggcgcc agcctacctc gctcctcggc gccatgacca caaccaccac 120 

cttcaaggga gtcgacccca acagcaggaa tagctcccga gttttgcggc ctccaggtgg 180 

tggatccaat ttttcattag gttttgatga accaacagaa caacctgtga ggaagaacaa 240 

aatggcctct aatatctttg ggacacctga agaaaatcaa gcttcttggg ccaagtcagc 300 

aggtgccaag tctagtggtg gcagggaaga cttggagtca tctggactgc agagaaggaa 360 

ctcctctgaa gcaagctccg gagacttctt agatctgaag ggagaaggtg atattcatga 420 

aaatgtggac acagacttgc caggcagcct ggggcagagt gaagagaagc ccgtgcctgc 480 

tgcgcctgtg cccagcccgg tggccccggc cccagtgcca tccagaagaa atccccctgg 540 

cggcaagtcc agcctcgtct tgggttagct ctgactgtcc tgaacgctgt cgttctgtct 600 

gtttcctcca tgcttgagaa ctgcacaact tgagcctgac tgtacatctt cttggatttg 660 
tttcattaaa aagaagcact ttatgtaaaa aaaaaaaaaa aaaa 704 

<210> 252 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_016185 



<400> 252 

tgaaccaaca gaacaacctg tgaggaagaa caaaatggcc tctaatatct ttgggacacc 

<210> 253 
<211> 2268 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> Modified_base 
<222> 1 . . . 2268 
<223> n = a,c,g, or t 

<300> 



<308> NM_016359 



<400> 253 

gggatttgaa ccncgctgac gaagtttggt 

gaatcgcgat gatcatcccc tctctagagg 

agaacttagc caagagtctg ggtctccggg 

ccttgaaagg ctacattaaa catgaggcaa 

aaacttctgc atcctcttgt gatgagactg 

agagacagcc acttggccat gtcaccaaaa 

accctgactc acagcagaat cattcagaga 

atcatgaaaa gcaggaaagc caggatctca 

acgagcacca agaagctgag aatgctgttt 



gatccatctt ccgagtatog ccgggatttc 60 

agctggactc cctcaagtac agtgacctgc 120 

ccaacctgag ggcaaccaag ttgttaaaag 180 

gaaaaggaaa tgagaatcag gatgaaagtc 240 

agatacagat cagcaaccag gaagaagctg 300 

caaggagaag gtgcaagact gtccgtgtgg 360 

taaaaataag taatcccact gaattccaga 420 

gagctactgc aaaagttcct tctccaccag 480 

cctcaggtaa cagagattca aaggtacctt 540 
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cagaaggaaa gaaatctctc tacacagatg agtcatccaa acctggaaaa aataaaagaa 600 

ctgcaatcac tactccaaac tttaagaagc ttcatgaagc tcattttaag gaaatggagt 660 

ccattgatca atatattgag agaaaaagaa acattttgaa gaacacaatt ccatgaatga 720 

actgaagcag cagcccatca ataagggagg ggtcaggact ccagtacctc caagaggaag 780 

actctctgtg gcttctactc ccatcagcca acgacgctcg caaggccggt cttgtggccc 840 

tgcaagtcag agtaccttgg gtctgaaggg gtcactcaag cgctctgcta tctctgcagc 900 

taaaacgggt gtcaggtttt cagctgctac taaagataat gagcataagc gttcactgac 960 

caagactcca gccagaaagt ctgcacatgt gaccgtgtct gggggcaccc caaaaggcga 1020 

ggctgtgctt gggacacaca aattaaagac catcacgggg aattctgctg ctgttattac 1080 

cccattcaag ttgacaactg aggcaacgca gactccagtc tccaataaga aaccagtgtt 1140 

tgatcttaaa gcaagtttgt ctcgtcccct caactatgaa ccacacaaag gaaagctaaa 1200 

accatggggg caatctaaag aaaataatta tctaaatcaa catgtcaaca gaattaactt 1260 

ctacaagaaa acttacaaac aaccccatct ccagacaaag gaagagcaac ggaagaaacg 1320 

cgagcaagaa cgaaaggaga agaaagcaaa ggttttggga atgcgaaggg gcctcatttt 1380 

ggctgaagat taataatttt ttaatatctt gtaaatattc ctgtattctc aacttttttc 1440 

cttttgtaaa tttttttttt tttgctgtca tccccacttt agtcacgaga tctttttctg 1500 

ctaactgttc atagtctgtg tagtgtccat gggttcttca tgtgctatga tctctgaaaa 1560 

gacgttatca ccttaaagct caaattcttt gggatggttt ttacttaagt ccattaacaa 1620 

ttcaggtttc taacgagacc catcctaaaa ttctgtttct agatttttaa tgtcaagttc 1680 

ccaagttccc cctgctggtt ctaatattaa cagaactgca gtcttctgct agccaatagc 1740 

atttacctga tggcagctag ttatgcaagc ttcaggagaa tttgaacaat aacaagaata 1800 

gggtaagctg ggatagaaag gccacctctt cactctctat agaatatagt aacctttatg 1860 

aaacggggcc atatagtttg gttatgacat caatatttta cctaggtgaa attgtttagg 1920 

cttatgtacc ttcgttcaaa tatcctcatg taattgccat ctgtcactca ctatattcac 1980 

aaaaataaaa ctctacaact cattctaaca ttgcttactt aaaagctaca tagccctatc 2040 

gaaatgcgag gattaatgct ttaatgcttt tagagacagg gtctcactgt gttgcccagg 2100 

ctggtctcaa actccaccaa atgtacttct tattcatttt atggaaaaga ctaggctttg 2160 

cttagtatca tgtocatgtt tccttcacct cagtggagct tctgagtttt atactgctca 2220 
agatcgtcat aaataaaatt ttttctcatt gtcaaaaaaa aaaaaaaa 2268 

<210> 254 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_016359 
<400> 254 

acattgctta cttaaaagct acatagccct atcgaaatgc gaggattaat gctttaatgc 60 

<210> 255 
<211> 1590 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_016816 
<400> 255 

gaggcagttc tgttgccact ctctctcctg tcaatgatgg atctcagaaa taccccagcc 60 

aaatctctgg acaagttcat tgaagactat ctcttgccag acacgtgttt ccgcatgcaa 120 

atcgaccatg ccattgacat catctgtggg ttcctgaagg aaaggtgctt ccgaggtagc 180 

tcctaccctg tgtgtgtgtc caaggtggta aagggtggct cctcaggcaa gggcaccacc 240 

ctcagaggcc gatctgacgc tgacctggtt gtcttcctca gtcctctcac cacttttcag 300 

gatcagttaa atcgccgggg agagttcatc caggaaatta ggagacagct ggaagcctgt 360 

caaagagaga gagcactttc cgtgaagttt gaggtccagg ctccacgctg gggcaacccc 420 

cgtgcgctca gcttcgtact gagttcgctc cagctcgggg agggggtgga gttcgatgtg 480 

ctgcctgcct ttgatgccct gggtcagttg actggcagct ataaacctaa cccccaaatc 540 

tatgtcaagc tcatcgagga gtgcaccgac ctgcagaaag agggcgagtt ctccacctgc 600 

ttcacagaac tacagagaga cttcctgaag cagcgcccca ccaagctcaa gagcctcatc 660 
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cgcctagtca agcactggta ccaaaattgt 
tatgccctgg agctcctgac ggtctatgct 
aacacagccc aaggatttcg gacggtcttg 
atctactgga caaagtatta tgactttaaa 
cagctcacga aacccaggcc tgtgatcctg 
ggtggagacc caaagggttg gaggcagctg 
ccatgcttta agaattggga tgggtcccca 
aacagtacag acgatgagac cgacgatccc 
acacatgagt accctcattt ctctcataga 
caggcagaag aggactggac ctgcaccatc 
ggctccagtg ttatctggac cagttccttc 
gacaaagctc ctcagtgagc tggtgtataa 
tggccttcta tgccctctat cctatcatag 
acctattctc tgaaaatatt ccctgagaga 
ttccagcctt gactttcttc tgtgcacctg 
taacaataaa aataaagcaa ataccaaaaa 




PCT/LS2006/004280 



aagaagaagc ttgggaagct gccacctcag 720 

tgggagcgag ggagcatgaa aacacatttc 780 

gaattagtca taaactacca gcaactctgc 840 

aaccccatta ttgaaaagta cctgagaagg 900 

gacccggcgg accctacagg aaacttgggt 960 

gcacaagagg ctgaggcctg gctgaattac 1020 

gtgagctcct ggattctgct ggctgaaagc 1080 

aggacgtatc agaaatatgg ttacattgga 1140 

cccagcacgc tccaggcagc atccacccca 1200 

ctctgaatgc cagtgcatct tgggggaaag 1260 

attttcaggt gggactcttg atccagagaa 1320 

tccaagacag aacccaagtc tcctgactcc 1380 

ataacattct ccacagcctc acttcattcc 1440 

gaacagagag atttagataa gagaatgaaa 1500 

atgggagggt aatgtctaat gtattatcaa 1560 
1590 



<210> 256 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_016816 



<400> 256 

cgatcccagg acgtatcaga aatatggtta cattggaaca catgagtacc ctcatttctc 

<210> 257 
<211> 2905 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> NM_016817 



<400> 257 

cggcagccag ctgagagcaa tgggaaatgg 
gaagctgggt tggtttatcc aggaatacct 
cgacgagatg gtgaacacca tctgtgacgt 
gcagggagtg gccataggtg gctcctatgg 
tggtaccctt gtccttttct tcagtgactt 
acgtgacatc ctcgataaaa ctggggataa 
gaaaaacaat ttcgagatcc agaagtccct 
aaatcagaga atctctttcg aggtgctggc 
tcccagcccc tggatctatc gagagctcaa 
tggtgagttt gcagtctgct tcactgaact 
aaaactaaag gatttgatcc tcttgataaa 
caaggattta ccctcgctgt ctccgtatgc 
acaggggtgc agaaaagaca actttgacat 
gatcaaatgc caggagaagc tgtgtatcta 
gaccatcagg aacatcctgc tgcaccagct 
agttgaccca accaataatg tgagtggaga 
agctcaaacc tggttgactt ctcccaacct 
tgtcctgcct gcaccactct tcacgacccc 
gtttctccag cccaacaaat gcttcctaga 

tacattcctt aaagaaaact gcttccgaca 
aggatcaacc gccaaaggca cagctctgaa 
ccataactca cttaaaagct acacctccca 
aatccatgaa cagctgaaag ccttttggag 



ggagtcccag ctgtcctcgg tgcctgctca 60 

gaagccctac gaagaatgtc agacactgat 120 

ctgcaggaac cccgaacagt tccccctggt ■ 180 

acggaaaaca gtcttaagag gcaactccga 240 

aaaacaattc caggatcaga agagaagcca 300 

gctgaagttc tgtctgttca cgaagtggtt 360 

tgatgggtcc accatccagg tgttcacaaa 420 

cgccttcaac gctctgagct taaatgataa 480 

aagatccttg gataagacaa atgccagtcc 54 0 

ccagcagaag ttttttgaca accgtcctgg 600 

gcactggcat caacagtgcc agaaaaaaat 660 

cctggagctg cttacggtgt atgcctggga 720 

tgctgaaggc gtcagaacgg ttctggagct 780 

ttggatggtc aactacaact ttgaagatga 840 

ccaatcagcg aggccagtaa tcttggatcc 900 

taaaatatgc tggcaatggc tgaaaaaaga 960 

ggataatgag ttacctgcac catcttggaa 1020 

aggccacctt ctggataagt tcatcaagga 1080 

gcagattgac agtgctgtta acatcatccg 1140 

atcaacagcc aagatccaga ttgtccgggg 1200 

gactggctct gatgccgatc tcgtcgtgtt 1260 

aaaaaacgag cggcacaaaa tcgtcaagga- 1320 

ggagaaggag gaggagcttg aagtcagctt 1380 
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tgagcctccc 


aagtggaagg 


ctcccagggt 


gctgagcttc 


tctctgaaat 


ccaaagtcct 


1440 


caacgaaagt 


gtcagctttg 


atgtgcttcc 


tgcctttaat 


gcactgggtc 


agctgagttc 


1500 


tggctccaca 


cccagccccg 


aggtttatgc 


agggctcatt 


gatctgtata 


aatcctcgga 


1560 


cctcccggga 


ggagagtttt 


ctacctgttt 


cacagtcctg 


cagcgaaact 


tcattcgctc 




ccggcccacc 


aaactaaagg 


atttaattcg 


cctggtgaag 


cactggtaca 


aagagtgtga 


1680 


aaggaaactg 


aagccaaagg 


ggtctttgcc 


cccaaagtat 




gc caeca 


1740 


ctatgcctgg 


gagcagggga 


gtggagtgcc 


ggattttgac 


actacaaaaa 




1800 


agtcctggag 


ctggtcacac 


aatatcagca 


gctcggcatc 


ttctqqaaq 
yy yy 


tcaattaca 


1860 


ctttgaagat 


gagaccgtga 


ggaagtttct 






ccaggcc g 




gatcttggac 


ccaggcgaac 


ccacaggtga 


cataaataaa 
^y '-y y y ^-yy a 


tt 

gygg g 


gg g gg 




tcttctggac 


aaagaagcaa 


aggttaggtt 




gc caagg 


atgggactgg 




aaacccaata 


ccaccttgga 








gtggagctag 




gatccatcct 


attgtcaatg 


agatgttctc 


atccaaa 303 


cata^aatcc 
g 


tgaataataa 


?i fin 


ttctaaaaga 


aacttctgga 


gatcatctgg 


caatcgcttt 




gc caccg g 




agaaagagtc 


actcacatcc 


attcttccct 


tgatggtccc 


tattcctcct 






tcttggactt 


cttgaaatca 


atcaagactg 


caaacccfctt 


cataaagctg 


ecttgetgaa 




ctcctctctg 


caggagccct 


gcttaaaata 




tcactttatg 


tgcatcttat 


2400 


ttctgtcaac 


ttgtattttt 


ttttcttgta 


tttttccaat 


tagctcctcc 


tttttccttc 


2460 


cagtctaaaa 


aaggaatcct 


ctgtgtcttc 


aaagcaaagc 


tctttacttt 


ccccttggtt 


2520 


tGtot a t° tC 


tgtgatcttg 


ctctcggtgc 


ttccaactca 


tccacgtcct 


gtctgtttcc 


2580 






tgcccctgct 


gacacagaca 


tcctctatgc 


cagcagccag 


2640 


gccaaccctt 


tcattagaac 


ttcaagctct 


ccaaaggctc 


agattataac 


tgttgtcata 


2700 


tttatatgag 


gctgttgtct 


tttccttctg 


agcctgcctt 


tatcccccca 


cccaggagta 


2760 


tcctcttgcc 


aaagcaaaag 


actttttcct 


tggctttagc 


cttaaagata 


cttgaaggtc 


2820 


taggtgcttt 


aacctcacat 


accctcactt 


aaacttttat 


cactgttgca 


tataccagtt 


2880 


gtgatacaat 


aaagaatgta 


tctgg 2905 




<210> 258 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_01G817 












<400> 258 














aaggtctagg tgctttaacc 


tcacataccc 


tcacttaaac 


ttttatcact 


gttgeatata 


60 


<210> 259 














<211> 2054 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_017414 












<400> 259 














gggaagctcg 


ggccggcagg 


gtttccccgc 


acgctggcgc 


ccagc cccg 


gegeggagge 




cgctgtaagt 


ttcgctttcc 


attcagtgga 


aaacgaaagc 


tgggcggggt 


gccacgagcg 


i o! 


cggggccaga 


ccaaggcggg 


cccggagcgg 


aacttcggtc 


ccagctcggt 


ccccggctca 




gtcccgacgt 


ggaactcagc 


agcggaggct 


ggacgcttgc 




agagattcca 


o^n 


tcgtgcctgg 


ctcacataag 


cgcttcctgg 


aagtgaagtc 


gtgctgtcct 


gaacgeggge 


300 


caggcagctg 


cggcctgggg 


gttttggagt 


gatcacgaat 


gagcaaggcg 


tttgggctcc 


360 


tgaggcaaat 


ctgtcagtcc 


atcctggctg 


agtcctcgca 


gtccccggca 


gatcttgaag 


420 


aaaagaagga 


agaagacagc 


aacatgaaga 


gagagcagcc 


cagagagcgt 


cccagggcct 


480 


gggactaccc 


tcatggcctg 


gttggtttac 


acaacattgg 


acagacctgc 


tgecttaact 


540 


ccttgattca 


ggtgttcgta 


atgaatgtgg 


acttcaccag 


gatattgaag 


aggatcaegg 


600 


tgcccagggg 


agctgacgag 


cagaggagaa 


gcgtcccttt 


ccagatgctt 


ctgctgctgg 


660 


agaagatgca 


ggacagccgg 


cagaaagcag 


tgcggcccct 


ggagctggcc 


tactgcctgc 


720 


agaagtgcaa 


cgtgcccttg 


tttgtccaac 


atgatgctgc 


ccaactgtac 


ctcaaactct 


780 


ggaacctgat 


taaggaccag 


atcactgatg 


tgcacttggt 


ggagagactg 


caggccctgt 


840 
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atacgatccg 


ggtgaaggac 


tccttgattt 


gcgttgactg 


tgccatggag 


agtagcagaa 


900 


acagcagcat 


gctcaccctc 


ccactttctc 


tttttgatgt 


ggactcaaag 


cccctgaaga 


960 


cactggagga 


cgccctgcac 


tgcttcttcc 


agcccaggga 


gttatcaagc 


aaaagcaagt 


1020 


gcttctgtga 


gaactgtggg 


aagaagaccc 


gtgggaaaca 


ggtcttgaag 


ctgacccatt 


1080 


tgccccagac 


cctgacaatc 


cacctcatgc 


gattctccat 


caggaattca 


cagacgagaa 


1140 


agatctgcca 


ctccctgtac 


ttcccccaga 


gcttggattt 


cagccagatc 


cttccaatga 


1200 


agcgagagtc 


ttgtgatgct 


gaggagcagt 


ctggagggca 


gtatgagctt 


tttgctgtga 


1260 


ttgcgcacgt 


gggaatggca 


gactccggtc 


attactgtgt 


ctacatccgg 


aatgctgtgg 


1320 


atggaaaatg 


gttctgcttc 


aatgactcca 


atatttgctt 


ggtgtcctgg 


gaagacatcc 


1380 


agtgtaccta 


cggaaatcct 


aactaccact 


ggcaggaaac 


tgcatatctt 


ctggtttaca 


1440 


tgaagatgga 


gtgctaatgg 


aaatgcccaa 


aaccttcaga 


gattgacacg 


ctgtcatttt 


1500 


ccatttccgt 


tcctggatct 


acggagtctt 


ctaagagatt 


ttgcaatgag 


gagaagcatt 


1560 


gttttcaaac 


tatataactg 


agccttattt 


ataattaggg 


atattatcaa 


aatatgtaac 


1620 


catgaggccc 


ctcaggtcct 


gatcagtcag 


aatggatgct 


ttcaccagca 


gacccggcca 


1680 


tgtggctgct 


cggtcctggg 


tgctcgctgc 


tgtgcaagac 


attagccctt 


tagttatgag 


1740 


cctgtgggaa 


cttcaggggt 


tcccagtggg 


gagagcagtg 


gcagtgggag 


gcatctgggg 


1800 


gccaaaggtc 


agtggcaggg 


ggtatttcag 


tattatacaa 


ctgctgtgac 


cagacttgta 


1860 


tactggctga 


atatcagtgc 


tgtttgtaat 


ttttcacttt 


gagaaccaac 


attaattcca 


1920 


tatgaatcaa 


gtgttttgta 


actgctattc 


atttattcag 


caaatattta 


ttgatcatct 


1980 


cttctccata 


agatagtgtg 


ataaacacag 


tcatgaataa 


agttattttc 


cacaaaaaaa 


2040 


aaaaaaaaaa 


aaaa 2054 












<210> 260 














<211> 60 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_017414 












<400> 260 














tgagcatctc ttctccataa 


gatagtgtga 


taaacacggt 


catgaataaa 


gttattttcc 


60 


<210> 261 














<211> 3638 














<212> DNA 














<213> Homo 


sapiens 












<300> 














<308> NM_017523 













ggtagatgcg gctgtgacag cagcaaagaa tgacggccaa gggcgacagc aggggctggc 60 

catgctgtaa aggggcttct tgggagggtc cagcctcagg aatcaagggg aactcctgag 120 

ccgagaattc tgaagatctc ctccctccct gaagctgtgg gctgggccat cggaaaactt 180 

tcagttttgt ttccttgcct gcaagaaacg aaactcaacc gaaagcctgc agagagcaga 240 

acatggaagg agacttctcg gtgtgcagga actgtaaaag acatgtagtc tctgccaact 300 

tcaccctcca tgaggcttac tgcctgcggt tcctggtcct gtgtccggag tgtgaggagc 360 

ctgtccccaa ggaaaccatg gaggagcact gcaagcttga gcaccagcag gttgggtgta 4 20 

cgatgtgtca gcagagcatg cagaagtcct cgctggagtt tcataaggcc aatgagtgcc 4 80 

aggagcgccc tgttgagtgt aagttctgca aactggacat gcagctcagc aagctggagc 540 

tccacgagtc ctactgtggc agccggacag agctctgcca aggctgtggc cagttcatca 600 

tgcaccgcat gctcgcccag cacagagatg tctgtcgcag tgaacaggcc cagctcggga 660 

aaggggaaag aatttcagct cctgaaaggg aaatctactg tcattattgc aaccaaatga 720 

ttccagaaaa taagtatttc caccatatgg gtaaatgttg tccagactca gagtttaaga 780 

aacactttcc tgttggaaat ccagaaattc ttccttcatc tcttccaagt caagctgctg 840 

aaaatcaaac ttccacgatg gagaaagatg ttcgtccaaa gacaagaagt ataaacagat 900 

ttcctcttca ttctgaaagt tcatcaaaga aagcaccaag aagcaaaaac aaaaccttgg 960 

atccactttt gatgtcagag cccaagccca ggaccagctc ccctagagga gataaagcag 1020 

cctatgacat tctgaggaga tgttctcagt gtggcatcct gcttcccctg ccgatcctaa 1080 

atcaacatca ggagaaatgc cggtggttag cttcatcaaa aggaaaacaa gtgagaaatt 1140 



<400> 261 
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tcagctagat ttggaaaagg aaaggtacta caaattcaaa agatttcact tttaacactg 1200 

gcattcctgc ctacttgctg tggtggtctt gtgaaaggtg atgggtttta ttcgttgggc 1260 

tttaaaagaa aaggtttggc agaactaaaa acaaaactca cgtatcatct caatagatac 1320 

agaaaaggct tttgataaaa ttcaacttga cttcatgtta aaaaccctca acaaaccagg 1380 

cgtcgaagga acatacctca aaataataag agccatctat gacaaaacca cagccaacat 1440 

catactgaat gagcaaaagc tggagcatta ctcttgagaa gtagaacaag gcacttcagt 1500 

cctattcaac atagtactgg aagtcctcgc cacagcaatc aggcaagaga aagaaataaa 1560 

aggcaaccaa aaagaaagga agtcgaagta tctctgtttg cagacgatat gattctatat 1620 

ctagaaaacc ccatgatctt ggcccaaaag ctcctagatc tgataaacaa cttcagctaa 1680 

ctttcaggag acaaaatcaa tatacaaaat atggtagcat ttttatacac caacgacatc 1740 

caagctgaga gccaaatcaa gaatgcaatc ctattcacaa ttgccacaaa aagaataaaa 1800 

tacctaggaa tacagctaac cagggagatg aaagatctct acaacaaaaa ttacaaaaca 1860 

ctgctgaaag aaatcagaga tgacacaaat ggaaaaacat tccatactta tggataggaa 1920 

gaatcaatat tgttaaaatg gccatactac ccaaagcaat ttatagattc aatgctattc 1980 

ctatcaaact accaataaca ttcttcacag aatcagaaaa aaaaagcatt aaaatttatt 2040 

tgaaaccaaa aaagagccca aaaagccaaa gcaatcctaa gcaaaaagaa caaagctgga 2100 

ggcatcgcat tacccaactt caaactatac tacagggcta cagtaaccaa aactgcatga 2160 

tactggtaca aaagcatggt gctggtacaa aagcagacac atagatcaat ggaacagaat 2220 

agagggccca gaaataaagc tacacaccta caaccatcta atctttgaca aagttgacaa 2280 

aaatacgcaa tggggaaaga attccccatt cagtaagtgg tactgggata actagctagc 2340 

catatgcaga ggattgaaac tgaaccactt ccttacacca tatgcaaaaa tcaactcaag 2400 

atggattaaa gacttaaatg taaaacccca aactataaaa actctggaag ataacctagg 24 60 

caataccatt ctggacatag gaacggaaaa agatttcatg acaaagatcc caaaaataat 2520 

tgtaacgaaa gcaaaaattg acaaatggga catgattaaa cagaattacc atttgactca 2580 

gcaatcccat tattggttat atacccaaag gaatctaaat cattctgtca taaagacata 2640 

tatacacaaa tgttcacggc agcactatac acaatcgcaa agtcagggaa tcaaactaaa 2700 

tgtccatcag tggtagaaag gataaagaaa atgtggtggc agggagtggt ggctcatgtc 27 60 

tgtaatccca gcactttggg aggctgaggc gggtggttca cctgaggtca ggagtttgag 2820 

accagcctgg ccaacatggc gaaactccgt ctccgctaaa aatacgaaaa ttagccaggc 2880 

gtggtggcga gcacctgtca tcccagctac ttgggaggcc taggcgtgag aatcgcttga 2940 

acctggaagg tggtggttgc agtgagccga gatcctgcca ctgcactcca gcctgggcaa 3000 

ccaagcgaga ctctgcctta aaaaaaaaaa aaagaaaatg tggcacatat acaccatgga 3060 

atactatgca gccataaaaa agaatgggat catgtcctgt gcagcaacgt ggatggagct 3120 

ggaagccatt atcctaaatg aactcactca gaaacagaaa accaaatacc acatgttctc 3180 

acttataagt agaagctaaa cattgagtac acatggatac aaagaaggga accgcagaca 3240 

ctggggccta cctgaggtcg gagcatggaa ggagggtgag gatcaaaaaa ctacctatct 3300 

ggtactatgc tttttatctg gatgatgaaa taatctgtac aacaaaccct ggtgacatgc 3360 

aatttaccta tatagcaagc ctacacatgt gcccctgaac ctaaaaaaaa agttaaaaga 3420 

aaaacgtttg gattattttc cctctttcga acaaagacat tggtttgccc aaggactaca 3480 

aataaaccaa cgggaaaaaa gaaaggttcc agttttgtct gaaaattctg attaagcctc 3540 

tgggccctac agcctggaga acctggagaa tcctacaccc acagaacccg gctttgtccc 3600 
caaagaataa aaacacctct ctaaaaaaaa aaaaaaaa 3638 

<210> 262 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_017523 
<400> 262 

ttggaaaagg aaaggtacta caaattcaaa agatttcact tttaacactg gcattcctgc 60 

<210> 263 
<211> 2461 
<212> DNA 
<213> Homo sapiens 



<300> 
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<308> NM_018410 
<400> 263 

atgctgggta cgctgcgcgc catggagggc gaggacgtgg aagacgacca gctgctgcag 
aagctcaggg ccagtcgccg ccgcttccag aggcgcatgc agcggctgat agagaagtac 
aaccagccct tcgaggacac cccggtggtg caaatggcca cgctgaccta cgagacgcca 
cagggattga gaatttgggg tggaagacta ataaaggaaa gaaacaaagg agagatccag 
gactcctcca tgaagcccgc ggacaggaca gatggctccg tgcaagctgc agcctggggt 
cctgagcttc cctcgcaccg cacagtcctg ggagccgatt caaaaagcgg tgaggtcgat 
gccacgtcag accaggaaga gtcagttgct tgggccttag cacctgcagt gcctcaaagc 
cctttgaaaa atgaattaag aaggaaatac ttgacccaag tggatatact gctacaaggt 
gcagagtatt ttgagtgtgc aggtaacaga gctggaaggg atgtacgtgt gactccgctg 
ccttcactgg cctcacctgc cgtgcctgcc cccggatact gcagtcgtat ctccggaaag 
agtcctggtg acccagcgaa accagcttca tctcccagag aatgggatcc tttgcatcct 
tcctccacag acatggcctt agtacctaga aatgacagcc tctccctaca agagaccagt 
agcagcagct tcttaagcag ccagcccttt gaagatgatg acatttgcaa tgtgaccatc 
agtgacctgt acgcagggat gctgcactcc atgagccggc tgttgagcac aaagccatca 
agcatcatct ccaccaaaac gttcatcatg caaaactgga actgcaggag gaggcacaga 
tataagagca ggatgaacaa aacatattgc aaaggagcca gacgttctca gaggagctcc 
aaggagaact tcataccctg ctctgagcct gtgaaaggga caggggcatt aagagattgc 
aagaacgtat tagatgtttc ttgccgtaag acaggtttaa aattggaaaa agcttttctt 
gaagtcaaca gaccccaaat ccataagtta gatccaagtt ggaaggagcg caaagtgaca 
ccctcgaagt attcttcctt gatttacttc gactccagtg caacatataa tcttgatgag 
gaaaatagat ttaggacatt aaaatggtta atttctcctg taaaaatagt ttccagacca 
acaatacgac agggccatgg agagaaccgt cagagggaga ttgaaatccg atttgatcag 
cttcatcggg aatattgcct gagtcccagg aaccagcctc gccggatgtg cctcccggac 
tcctgggcca tgaacatgta cagagggggt cctgcgagtc ctggtggcct tcagggctta 
gaaacccgca ggctgagttt accttccagc aaagcaaaag caaaaagttt aagtgaggct 
tttgaaaacc taggcaaaag atctctggaa gcaggtaggt gcctgcccaa gagcgattca 
tcttcatcac ttccaaagac caaccccaca cacagcgcaa ctcgcccgca gcagacatct 
gaccttcacg ttcagggaaa tagttctgga atatttagaa agtcagtgtc acccagcaaa 
actctttcag tcccagataa agaagtgcca ggccacggaa ggaatcgtta cgatgaaatt 
aaagaagaat ttgacaagct tcatcaaaag tattgcctca aatctcctgg gcagatgaca 
gtgcctttat gtattggagt gtctacagat aaagcaagta tggaagttcg atatcaaaca 
gaaggcttct taggaaaatt aaatccagac cctcacttcc agggtttcca gaagttgcca 
tcatcacccc tggggtgcag aaaaagtcta ctgggctcaa ctgcaattga ggctccttca 
tctacatgtg ttgctcgtgc catcacgagg gatggcacga gggaccatca gttccctgca 
aaaagaccca ggctatcaga accccagggc tccggacgcc agggcaattc cctgggtgcc 
tcagatgggg tggacaacac cgtcagaccg ggagaccagg gcagctcttc acagcccaac 
tcagaagaga gaggagagaa cacgtcttac aggatggaag agaaaagtga tttcatgcta 
gaaaaattgg aaactaaaag tgtgtagcta ggttatttcg gagtgttatt tatcttccca 
cttgctctct gtttgtattt ttgttttgtt tttgattctt gagactgtga ggacttggtt 
gacttctctg cccttaaagt aaatattagt gaaattggtt ccatcagaga taacctcgag 
ttcttggtgt agaaattatg tgaataaagt tgctcaatta gaaaaaaaaa aaaaaaaaaa 
a 2461 

<210> 264 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_018410 
<400> 264 

agtgatttca tgctagaaaa attggaaact aaaagtgtgt agctaggtta tttcggagtg 60 

<210> 265 

<211> 1405 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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<300> 

<308> NM_018455 
<400> 265 

cacctcgctc gcagcctccc cagcgcagca gcccggctgt gggcctgcgg cagccgggtc 60 

ttcctggtcc ccacctcctg gggccgacgg gcggcaggaa ggggctcggc gggacgcgcc 120 

gtcagggacc tgaggaggaa caacggaacg cgttcggaac ggcctggact cccgagactc 180 

acccgactcg tggccacacc gggagaactg aagcggcagt agccggcgga gacgcccgac 24 0 

ccgaaggccg gctgctaggg agcagacagc tgaaccgctt gccagacgcc gaaacccagt 300 

gacgccctcc accgctccac cgtgctcccg gctccccgcc cccgccgccc gcgggcccca 360 

aggcgcatgc gccgcctgtc ctggaggggc ccatttccgt ccgtcgtggg gggaggcaca 420 

gtgagtccac tggggcacgg cagcgtctaa gccacaagcc gagcacataa gccaggtcct 4 80 

aacggagcct atgtgtaagt ccactactgg tgcaaggttg cacacttcta agaagagcgg 540 

cgtggggggc tcggcgacct tcgcttcagt cgctcccccg tgcagtcccc tgtgcccaag 600 

acacagcctg atgcttgtgc tccggtgggc ggagcttgga ggcggcggga actgcaattg 660 

gtggctttga aggcgcggcg agcgggaaca gctcttgagg agtgagactg caggagatgt 720 

gggccgtgcc aaagagatgg atgagactgt tgctgagttc atcaagagga ccatcttgaa 780 

aatccccatg aatgaactga caacaatcct gaaggcctgg gattttttgt ctgaaaatca 840 

actgcagact gtaaatttcc gacagagaaa ggaatctgta gttcagcact tgatccatct 900 

gtgtgaggaa aagcgtgcaa gtatcagtga tgctgccctg ttagacatca tttatatgca 960 

atttcatcag caccagaaag tttgggatgt ttttcagatg agtaaaggac caggtgaaga 1020 

tgttgacctt tttgatatga aacaatttaa aaattcgttc aagaaaattc ttcagagagc 1080 

attaaaaaat gtgacagtca gcttcagaga aactgaggag aatgcagtct ggattcgaat 1140 

tgcctgggga acacagtaca caaagccaaa ccagtacaaa cctacctacg tggtgtacta 1200 

ctcccagact ccgtacgcct tcacgtcctc ctccatgctg aggcgcaata caccgcttct 12 60 

gggtcaggag ttagaagcta ctgggaaaat ctacctccga caagaggaga tcattttaga 1320 

tattaoogaa atgaagaaag cttgcaatta gtgaacatga aaggaaaata aaaattcctc 1380 
acagtcaaaa aaaaaaaaaa aaaaa 1405 

<210> 266 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_018455 
<400> 266 

ccgacaagag gagatcattt tagatattac cgaaatgaag aaagcttgca attagtgaac 60 

<210> 267 
<211> 927 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0184 65 
<400> 267 

ggcagcgggc gaaaggagcc ggggcctgga ggtttgcgta ccggtcgcct ggtcccggca 60 

ccagcgccgc ccagtgtggt ttcccataag gaagctcttc ttcctgcttg gcttccacct 120 

ttaacccttc cacctgggag cgtcctctaa cacattcaga ctacaagtcc agacccagga 180 

gagcaaggcc cagaaagagg tcaaaatggg gtttatattt tcaaaatcta tgaatgaaag 240 

catgaaaaat caaaaggagt tcatgcttat gaatgctcga cttcagctgg aaaggcagct 300 

catcatgcag agtgaaatga gggaaagaca aatggccatg cggattgcgt ggtctcggga 360 

attcctcaaa tattttggaa ctttttttgg ccttgcagcc atctctttaa cagctggagc 420 

gattaaaaaa aagaagccag ccttcctggt cccgattgtt ccattaagct ttatcctcac 480 

ctaccagtat gacttgggct atggaaccct tttagaaaga atgaaaggtg aagctgagga 540 

catactggaa acagaaaaga gtaaattgca gctgccaaga ggaatgatca cttttgaaag 600 

cattgaaaaa gccagaaagg aacagagtag attcttcata gacaaatgaa atcatgctta 660 
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ccaatcaaat ctcaaagcac agaattattg acttgaatca tggtttttac agttttttaa 720 

atgctcaaga ttttgatatt atagatttta ttttaaaata ttaaaatgca agatagtttt 780 

gagctatttt aaaataaaat ttataacatt caacacaaaa tcatggaggt gctctaaata 840 

acttttagat ttcctctctc tgtgtgcatt accaatatct aagtgtaaaa ttaataaatt 900 

gttttgaatt cctggaaaaa aaaaaaa 927 

<210> 268 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0184 65 
<400> 268 

ggaacagagt agattcttca tagacaaatg aaatcatgct taccaatcaa atctcaaagc 60 

<210> 269 
<211> 1047 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_018487 
<400> 269 

cccacttctc cagccagcgc cccagcoctc ccgccgcccg ctcgcaggtc ccgaggagcg 60 

cagactgtgt ccctgacaat gggaacagcc gacagtgatg agatggcccc ggaggcccca 120 

oagcacacoc acatcgatgt gcacatccac caggagtctg ccctggccaa gctcctgctc 180 

acctgctgct ctgcgctgcg gccccgggcc acccaggcca ggggcagcag ccggctgctg 240 

gtggcctcgt gggtgatgca gatcgtgctg gggatcttga gtgcagtcct aggaggattt 300 

ttctacatcc gcgactacac cctcctcgtc acctcgggag ctgccatctg gacaggggct 360 

gtggctgtgc tggctggagc tgctgccttc atttacgaga aacggggtgg tacatactgg 420 

gccctgctga ggactctgct aacgctggca gctttctcca cagccatcgc tgccctcaaa 480 

ctttggaatg aagatttccg atatggctac tcttattaca acagtgcctg ccgcatctcc 540 

agctcgagtg actggaacac tccagccccc actcagagtc cagaagaagt cagaaggcta 600 

cacctatgta cctccttcat ggacatgctg aaggccttgt tcagaaccct tcaggccatg 660 

ctcttgggtg tctggattct gctgcttctg gcatctctga cccctctgtg gctgtactgc 720 

tggagaatgt tcccaaccaa agggaaaaga gaccagaagg aaatgttgga agtgagtgga 780 

atctagccat gcctctcctg attattagtg cctggtgctt ctgcaccggg cgtccctgca 840 

tctgactgct ggaagaagaa ccagactgag gaaaagaggc tcttcaacag ccccagttat 900 

cctggcccca tgaccgtggc cacagccctg ctccagcagc acttgcccat tccttacacc 960 

ccttccccat cctgctccgc ttcatgtccc ctcctgagta gtcatgtgat aataaactct 1020 
catgttattg ttcccaggaa aaaaaaa 1047 

<210> 270 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_018487 
<400> 270 

aaccaaaggg aaaagagacc agaaggaaat gttggaagtg agtggaatct agccatgcct 60 

<210> 271 
<211> 2280 
<212> DNA 
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<213> Homo sapiens 
<300> 

<308> U17077 



<400> 271 

ccgcccgcca ccagctacgc cccgtccgac gtgccctcgg gggtcgcgct gttcctcacc 60 

atccctttcg ccttcttcct gcccgagctg atatttgggt tcttggtctg gaccatggta 120 

gccgccaccc acatagtata ccccttgctg caaggatggg tgatgtatgt ctcgctcacc 180 

tcgtttctca tctccttgat gttcctgttg tcttacttgt ttggatttta caaaagattt 240 

gaatcctgga gagttctgga cagcctgtac cacgggacca ctggcatcct gtacatgagc 300 

gctgccgtcc tacaagtaca tgccacgatt gtttctgaga aactgctgga cccaagaatt 360 

tactacatta attcggcagc ctcgttcttc gccttcatcg ccacgctgct ctacattctc 420 

catgccttca gcatctatta ccactgatgc acaggcgcca ggccaagggg gaaatgctct 4 80 

ttgaaagctc caattattgg tccccaaaag cagcttccaa cgtttgccat ctggatgaca 540 

aacggaagat ccactaaaac gtccacggga ttaacagaac gtccttgcag actgagcgat 600 

gacaccacac tttgtttgga catttaaatt cactctgctg aataggagga agcttttctt 660 

tttcctggga aaacaactgt ctcttggaat tatctgacca tgaacttgct cttctagaca 720 

actcacatca aagccctcac tccactaatg gagaatccta gccccactaa tgccaagtct 780 

gtttggggat tttgcctcag ctatgggctt ccctagagta ggtctagggg aatactcagt 840 

ctgatctttt ttttgtttgt tttattttgt tttttttgag acggagtctc gctcttcctc 900 

caaggctgga gtgcagtgac gcgatctcca ctcactgcag gctccgcctc ccgggttccc 960 

gccattctcc tgcctcagcc tcccgagtag ccgggactac aggcgcccac caccatgccc 1020 

ggctaattta gttgtatttt tagtagagat- ggggtttcac cgtattagcc aggatggtct 1080 

cgatctcctg acctcgtgat ccgcccgcct cggcctccca aagtgctggg attacaggcg 1140 

tgagccaccg tgcccggcct gattctctta aaattgaaga ggtgctgcca aggccttcag 1200 

atctaacgca gatgcataga ccttgttcct ggtacttgtt cagcctgtgc tggggagccg 1260 

tggtcccgag ttccctggga ggctgacagg gtcaagccac cctgcccacc accctcccac 1320 

ttcccctccc ctttcctctc cagcattagg attcaaggga aatctgcatg aagccaattt 1380 

tgagggtaga cgtgtgggga aaataaatca ttatacagta agacctgggg cttgaggggt 1440 

ggggaatggg gagggaaggg catagcctgc tcctccatga gtctgacatc tcggaaactg 1500 

agcagctgcc ggacgcctgg gtcaggaatc caagacccca cctcttaagg actggttcct 1560 

cagaaagcac cctcagggaa aaaggtgaaa acattacatc cgtggattct cctgccacaa 1620 

ccgcattgga agaaaaggct gccgcaacat ctcagcgagg agtgaaggac ccatgtccca 1680 

ggaaccgcgc tgcgccacct gcactcaccc ccctcacatt ctcttaagca cccggtggcc 1740 

ctccgaggct ggcggaatgg tggtgcccac ggggttgggc aagggctcac caggacctca 1800 

acgggcaaag ttgtgcacac taaaatatca aatcaaggtg cttggtttta aagtaaatgt 1860 

ttttctaaag aaagctgtgt tcttctgttg acccagacga atagggcaca gccctgtaac 1920 

tgcacgtgcc ttctgtcatt gggaatgaaa taaattatta cgagaaaggg acttgtccta 1980 

actggtttga ggccttacag ttttgtatct acatttttcc cctcctgggg tttgcgggga 2040 

cagggacaga actacaggag tcatgggaaa gaaaattctg gcttcactac tgctcactgc 2100 

tcactttctg atcactctga tacttttttt tttttttttt ttttgcaacc tgataccttg 2160 

aaaagcttct atgtgtctct ccttttgttg cctggcagct gtctaggatg atcactgatt 2220 

actatttact aagtagccac atgcaaataa aagttgtttg gtaaaatgga aaaaaaaaaa 2280 

<210> 272 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> U17077 
<400> 272 

tcaccaggac ctcaacgggc aaagttgtgc acactaaaat atcaaatcaa ggtgcttggt 60 

<210> 273 
<211> 2554 
<212> DNA 
<213> Homo sapiens 
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<300> 

<308> X87949 
<400> 273 

aggtcgacgc cggccaagac agcacagaca gattgaccta 
gagagggaag cgccgcggcc tgtatttcta gacctgccct 
tgtgaccccg ggcccctgcc gcctgcaagt cggaaattgc 
cctgtggctg gactgcctgc tgctgcccaa ctggctggca 
gccgcgatgc tgctgctgct cagcgcggcg cgggccgagg 
gtgggcacgg tggtcggcat cgacttgggg accacctact 
aacggccgcg tggagatcat cgccaacgat cagggcaacc 
gccttcactc ctgaagggga acgtctgatt ggcgatgccg 
aaccccgaga acacggtctt tgacgccaag cggctcatcg 
tctgtgcagc aggacatcaa gttcttgccg ttcaaggtgg 
tacattcaag ttgatattgg aggtgggcaa acaaagacat 
gccatggttc tcactaaaat gaaagaaacc gctgaggctt 
catgcagttg ttactgtacc agcctatttt aatgatgccc 
gctggaacta ttgctggcct aaatgttatg aggatcatca 
attgcttatg gcctggataa gagggagggg gagaagaaca 
ggcggaacct tcgatgtgtc tcttctcacc attgacaatg 
actaatggag atactcatct gggtggagaa gactttgacc 
atcaaactgt acaaaaagaa gacgggcaaa gatgtcagga 
aaactccggc gcgaggtaga aaaggccaag gccctgtctt 
gaaattgagt ccttctatga aggagaagac ttttctgaga 
gaagagctca acatggatct gttccggtct actatgaagc 
gattctgatt tgaagaagtc tgatattgat gaaattgttc 
attccaaaga ttcagcaact ggttaaagag ttcttcaatg 
ataaacccag atgaagctgt agcgtatggt gctgctgtcc 
gatcaagata caggtgacct ggtactgctt catgtatgtc 
actgtaggag gtgtcatgac caaactgatt ccaagtaata 
tctcagatct tttctacagc ttctgataat caaccaactg 
ggtgaaagac ccctgacaaa agacaatcat cttctgggta 
cctcctgctc ctcgtggggt cccacagatt gaagtcacct 
attcttcgag tgacagctga agacaagggt acagggaaca 
aatgaccaga atcgcctgac acctgaagaa atcgaaagga 
tttgctgagg aagacaaaaa gctcaaggag cgcattgata 
tatgcctatt ctctaaagaa tcagattgga gataaagaaa 
tctgaagata aggagaccat ggaaaaagct gtagaagaaa 
caccaagatg ctgacattga agacttcaaa gctaagaaga 
caaccaatta tcagcaaact ctatggaagt gcaggccctc 
acagcagaaa aagatgagtt gtagacactg atctgctagt 
ggactcagga acttttgtta ggaaaaaatt gaaagaactt 
atcttcacct cagagtggag ttgaactgct atagcctaag 
ttagcagttg ctcacatgtc tttgggtggg gggggagaag 
agcgggtaaa aaacctgggt tagggtgtgt gttcaccttc 
tgggtcatgt gcatctggtg taggaagttt tttctaccat 
tgttatttac actggtcaaa aaaaaaaaaa aaaa 2554 

<210> 274 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> X87949 



ttggggtgtt 
tcgcctggtt 
gctgtgctcc 
agatgaagct 
aggaggacaa 
cctgcgtcgg 
gcatcacgcc 
ccaagaacca 
gccgcacgtg 
ttgaaaagaa 
ttgctcctga 
atttgggaaa 
aacgccaagc 
acgagcctac 
tcctggtgtt 
gtgtcttcga 
agcgtgtcat 
aggacaatag 
ctcagcatca 
ccctgactcg 
ccgtccagaa 
ttgttggtgg 
gcaaggaacc 
aggctggtgt 
cccttacact 
cagtggtgcc 
ttacaatcaa 
catttgatct 
ttgagataga 
aaaataagat 
tggttaatga 
ctagaaatga 
agctgggagg 
agattgaatg 
aggaactgga 
ccccaactgg 
gctgtaatat 
aagtctcgaa 
cggctgttta 
aagaattggc 
aaaatgttct 
aagtgacacc 



tcgcgagtgt 
cgtggcgcct 
tgtgctacgg 
ctccctggtg 
gaaggaggac 
cgtgttcaag 
gtcctatgtc 
gctcacctcc 
gaatgacccg 
aactaaacca 
agaaatttct 
gaaggttacc 
aaccaaagac 
ggcagctgct 
tgacctgggt 
agttgtggcc 
ggaacacttc 
agctgtgcag 
agcaagaatt 
ggccaaattt 
agtgttggaa 
ctcgactcga 
atcccgtggc 
gctctctggt 
tggtattgaa 
taccaagaac 
ggtctatgaa 
gactggaatt 
tgtgaatggt 
cacaatcacc 
tgctgagaag 
gttggaaagc 
taaactttcc 
gctggaaagc 
agaaattgtt 
tgaagaggat 
tgtaaatact 
tgtaattgga 
ctgcttttca 
catcttaaaa 
atttaacaac 
aataaatgtt 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 



<400> 274 

aactttcctc tgaagataag gagaccatgg aaaaagctgt agaagaaaag attgaatggc 
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<211> 1359 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contigl632 



<400> 275 

ttttaagaca gttacctgtt gtgctgctgt tacaatatat aatgaaacca agtcagggga 60 

gtgaatttat caatcttttg atgtaaagta aaaacgtagt tcacacttca ggagagaact 120 

tcatagcaca atgtctttct ataagatatt tttaatgatt tagtatttta caacatttgt 180 

ttaccatatt ttgatatacc atttttttct atctgcccag ttttattaaa aaaactatat 240 

attattttct aaagaaacaa tcatattttt atacaaaatt atgttttcag gtaacgaaat 300 

agatgtaggg tacagtggaa cataagcagt gttacccctg gctgggagtc agtattatac 360 

aacaaatggt gagctggaac atgccctgtc tgtgctgtcc ctcctgtgct gggtcgcgga 420 

tgtgtaggca acattgcctt atcacgctag gttcacctga cactttaaaa ggaaaaaaag 4 80 

ttccatagag ttctgtggtc acaaaattgt tttgctttta tcaaatactt taatagaacc 54 0 

aaagttgcag atattggaat gtatggaagt atctcagtct ctgcataaga ggattaaagt 600 

atgaaaggat catttaatga ctgttttact tataagtcat taagtaatcc accatttctt 660 

atggatgatg cttaagcctg gtgaggtttg tactctaagg agcccagatc ataatgcagt 720 

gcatttcctt agcccttaga gtttcttgca aacatttaaa aaaagacata tttaagaaag 780 

aaagataaag aaaaaacata tttaattact gtaaacaggt actgctttat gtttattttc 840 

tctctacttc aaccaaaatc agatctttga ggttttgctg acattgttgg tggttttgca 900 

catgttcttt ctaattggat ttatgaatag ttctatgggt tttcaaagat gaatcatgct 960 

aagaacactt ctgctttttg atccactgtt tgcagcagaa ttatatatat gtataggaaa 1020 

aatccacttt gaataatcca tgttttgtat ttggaaattg tttttaaaaa taaaaaggaa 1080 

aggaaatata taaagctgtt atttattctg catttcttac atatctatcg cttgtcagta 1140 

tacccgtttt ggtatatatt gcctctgcac atctacattt gtatatgcaa cagtgagctt 1200 

tatatctaca taaactgtaa ataatccttt ctgtgaaagg atcatcatat caagatgata 1260 

ccaaaagtat gtaaaaagaa acctgcatta ttttgtaatt atttcttata gatatttcat 1320 ■ 
ggtaagatta gcagtcaata aagttacttt tttgccttt 1359 

<210> 276 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contigl632 



<400> 276 

gggttttcaa agatgaatca tgctaagaac acttctgctt tttgatccac tgtttgcagc 60 

<210> 277 
<211> 994 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig3464 
<400> 277 

tgaatgtata tattaagact gtagctgaat tgcacatgaa atcagattgc caacttcttg 60 

actttcaatg ttagacattt atccttaagt tgtgagcgat atatgtagca tgctgtgaaa 120 

tgtctgttat agctctttaa ttcatcagta ttaatacaga attatcattt gcgtttcttg 180 

gtacttttta ttcaatgtaa tcagaagctg tgatgttttg cctttgtagt cctgtgcttt 240 

gttactgtaa tttttttttt tttttttacg aagcacgtga ctggactaat gtaaggcaga 300 

tgacgtgatc tttaagactg ctatatatat cagtctctta ctctataagg ttttaaatta 360 

gaataagctt ttatcaaata gataattgat gcaatttagg attcacgcaa gtttcagtgt 420 

caaatggcgg tcttatagtt tcaattctga aaatagcaaa cttaataaac agccacttta 4 80 

aacttgttct ggcaaaccag accctgctgt agatatagtc taaggtagtt aaccatataa 54 0 
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gccttttcaa ctcttaatgc cctccacatg aatcagcagt taagaaggtt ctagaaccca 600 

tgaaagcttt tgtatgtatt actaggtttt gtttttctta tgtttgctga ttttacagtt 660 

ctgactaaag ctgacctaaa tggatcagtt tatgtgtaat attctagtgc tttaatgact 720 

ctttttttct ttggagggag ggtaacatta tttggacaga tgcagaagga actgttagtg 780 

agtcaagaca aacacatctg aaataaagga actgtgtatt aacatgttaa caattcataa 840 

ctgcactttt tatgacattt tgaaaatcta tttataggta cagaacaatg ggttttgtta 900 

aactgtatca catttatact tgcagaaatt tatttcattg ttattagtag gaattttatt 960 

ggttcaataa aattggcaaa actgaacacc aaaa 994 

<210> 278 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig34 64 
<400> 278 

ctgctgtaga tatagtctaa ggtagttaac catataagcc ttttcaactc ttaatgccct 60 

<210> 279 
<211> 423 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contigl4683 
<400> 279 

tatgttatgg atatcttatt ttagagtaag aatataaggc atagccatat ttatgaaggt 60 
agtaatactc tactaatcaa tacttagaag tttttgttat gactaatctg aatgcttttt 120 
agtttttcct taatctagtt atgttggtaa tttataagtc agttttcaga ttaggaaaga 180 
aggtatttga gggtgttcca tttccactga atagtaagat gatgcttact tagatttcca 240 
cagctgtttg aaagctctgt atttggctat aacggaaaac tttgttaggg atgcttgatg 300 
ttttgtgttt tgtttctaaa ggaagacagt gttttgttcc ttctttagaa aacttgaaga 360 
atagaataat gagtccagga ttaatttggg ataaagtctt ttacttcata aattctgatt 420 

ctg 423 

<210> 280 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contigl4683 
<400> 280 

aggaagacag tgttttgttc cttctttaga aaacttgaag aatagaataa tgagtccagg 60 

<210> 281 
<211> 391 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig28552 
<400> 281 

atgccattga tgtgaagaag gtgtctgtgg aagactttct tactgacctg aataacttca 60 
gaaccacatt catgcaagca ataaaggaga atatcaaaaa aagagaagca gaggaaaaag 120 
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aaaaacgtgt cagaatagct aaagaattag cagagcgaga aagactcgaa cgccaacaaa 180 

agaaaaagcg tttattagaa atgaagactg agggtgatga gacaggagtg atggataatc 240 

tgctggaggc cttgcagtcc ggggctgcct tccgcgacag aagaaaaagg acaccgatgc 300 

caaaagatgt tcggcagagt ctcagtccaa tgtctcagag gcctgttctg aaagtttgta 360 

accatggtaa taaaccgtat ttataaattg c 391 

<210> 282 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig28552 
<400> 282 

aagactttct tactgacctg aataacttca gaaccacatt catgcaagca ataaaggaga 60 

<210> 283 
<211> 450 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig28947 
<400> 283 

ctcatccaag gagctggggc agacttcatt gattctagag agacctgttt cagtgcctac 60 
tcatccctgc cctctggtgc cagcctcctt accatcacgg cttcactgag gtgtaggtgg 120 
gtttttctta aacaggagac agtctctccc ctcttacctc aacttcttgg ggtgggaatc 180 
agtgatactg gagatggcta gttgctgtgt tacgggtttg agttacattt ggctataaaa 240 
caatcttgtt gggaaaaatg tgggggagag gacttcttcc tacacgcgca ttgagacaga 300 
ttccaactgg ttaatgatat tgtttgtaag aaagagattc tgttggttga ctgcct'aaag 360 
agaaaggtgg gatggccttc agattatacc agcttagcta gcattactaa ccaactgatg 420 
gaagctctga aaataaaaga tcttgaaccc 450 

<210> 284 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig28947 
<400> 284 

agacagattc caactggtta atgatattgt ttgtaagaaa gagattctgt tggttgactg, 60 

<210> 285 
<211> 439 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig30875 
<400> 285 

agaaatcaat gacagttgac aggaagagag gacgcataca acaggcaaaa gaggaatgcc 60 
cagcagtctt ggtccttgcg gtgcaatact ggccttgagg ccaagtcagc aggggattcg 120 
tagtcactaa cttctaactg aggcagggaa gtaccatgtt ctggaaaagg tccaaagaaa 180 
caggaataga ggcagtgtag caagaggcag atttttggtg ccaaatagat ttgaatcctg 24 0 
gttctgcttc ttcctttgta gagtatgata ttggttcttt cctcccaaag ctattataaa 300 
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gactaaatat gtacacaaat ctttgggatg tctgacatat aaatgcttaa caataggtat 360 
ttgctggtat tattacaaat gaatttgctt atttttgagc cacttctatg tctgtccatt 420 
aaaccaaaat gtgttctgc 439 

<210> 286 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig30875 
<400> 286 

ggttctttcc tcccaaagct attataaaga ctaaatatgt acacaaatct ttgggatgtc 60 

<210> 287 
<211> 338 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig31221 
<400> 287 

gggaagttac actgcttcac accacaaggc cgtgggaaat cttggaggtt ctgtgccttt 60 
ctgtcacctc tactttttgc agctgtgatt gcactgtccc gcacatgtga ctacaagcat 120 
cactggcaag gaccctttaa atggtgaaaa tgggcagatg aatagcaata agtggacctt 180 
tgttactctt ctgagttaga aaaattctaa tttagtacac tctgaacaaa gcttattata 240 
cttacttaag atgtgttttg atttggtgtt cagaaagcaa cctgacaatg ataatactgt 300 
aactatgata aaattgagaa taaaaagatt ttatttag 338 

<210> 288 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig31221 
<400> 288 

aaatgggcag atgaatagca ataagtggac ctttgttact cttctgagtt agaaaaattc 60 

<210> 289 

<211> 417 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig31288 
<400> 289 

gaatcacttg agcccgggag gttgaggctg cagtgagctg tgtttatacc actgcactcc 60 

agcctgctgg gtaacagagc aagactccat ctcaaaaaga aaagaaaaaa tgctttgcta 120 

cataatgagg ccaggcaaaa aaaaaaaaag tcctgtggaa atcatataga caaacatttg 180 

caaagctgct actgccattg taccagtgtt aaaatgtgtt ctaccttgca tcttttactg 24 0 

atttttatga cagattttat attgtaacca tttgagaact ctgtaagtgc tatggcttcc 300 

ttaaactacg atttatcata tgctcccagt gtttactttg agactgaatg gcaaccagag 360 

aatgtaaaca accaaggtgc atctggttat gttttaaaat aaagattaat aaaagtt 417 

<210> 290 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig31288 
<400> 290 

ggcttcctta aactacgatt tatcatatgc tcccagtgtt tactttgaga ctgaatggca 60 

<210> 291 
<211> 394 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig31646 
<400> 291 

gctgctacac cccatgtaaa aagcggaaaa taaaatgaag attttccagc gcaagatgcg 60 

gtactggttg cttccacctt ttttggcaat tgtttatttc tgcaccattg tccaaggtca 120 

agtggctcca cccacaaggt taagatataa tgtaatatct catgacagta tacagatttc 180 

atggaaggct ccaagaggga aatttggtgg ttacaaactt cttgtgactc caacttcagg 24 0 

tggaaaaact aaccagctga atctgcagaa cactgcaact aaagcaatta ttcaaggcct 300 

tatgccagac cagaattaca cagttcaaat tattgcatac aataaagata aagaaagcaa 360 

gccagctcaa ggccaattca gaattaaaga ttta 394 

<210> 292 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig31646 
<400> 292 

gccagaccag aattacacag ttcaaattat tgcatacaat aaagataaag aaagcaagcc 60 

<210> 293 
<211> 357 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig37562 
<400> 293 

caattatttc aagtgcacct tattaacaaa agtatcagtg gatccaacat aaaattttat 60 

agtactaaat gtcaagccta actgtgaatt ttgttctgta tcttaagtaa atttatgata 120 

atgttctcga gctatcaaca aaatatatgt acttttgtga gctatgaatt ttctaattaa 180 

attttacatg ctataacatg atttttacat gaatgatact ttgtttataa ctatcaaatg 240 

tcagtatttt actacaattt tattataaag tgtacattat cactaaatga acttcgattt 300 

taaaaatcaa attagcttta gttgtatatt attttttaca aataaagata gacttgt 357 

<210> 294 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig37562 
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<400> 294 

atcaaatgtc agtattttac tacaatttta ttataaagtg tacattatca ctaaatgaac 60 

<210> 295 
<211> 351 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig37895 
<400> 295 

aatagagaca cctctaatta attaaagcgg atgccctccc cactcctccc aggatttgac 60 
tcggagcaca aactcttcac aaaccaaaat gtcaggacac catcgccagt gtccactggc 120 
cactgctgtt ggtgtgaggc agccaggagc ccctcagaac tagtaagtct gagaagaggc 180 
tgcacggggc ctaggagagg gagaaatgag cccgtccaag gtgaattcct tgattctcca 240 
ttgtgagtgc accaagaaca agcactccct ccgactgact ctcgcctacc aggatctgga 300 
acaccttcca ttaatttatt cgttcattca ataaatattt attgactgac t 351 

<210> 296 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig37895 
<400> 296 

ctctcgccta ccaggatctg gaacaccttc cattaattta ttcgttcatt caataaatat 60 

<210> 297 
<211> 418 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig38288 
<400> 297 

gacaagtaaa tgggggccgt tgggacggcg ggtgcctgga gggcagctct gggctcagcg 60 

ggcagtgctt agagcacagg cccctctgtt gggggatggg gaggagagca gtctgccctt 120 

gggagcgtag gccccaggga gacttctaaa gccccccctg tcgtctgctc ttcacccagc 180 

accacagagg cacctgctgc acacacaagc atctcactcg gcccacggag ggggccaggc 240 

ttcctttgcc tgaagctgtt ttgggaaggg tctccacaca ggcactgatc tcccaagctt 300 

tggtcatgat gtcttttacc atttgataat tttaaacatt gtttttaaac ccaaaacatt 360 

tagtggtccg ttgcctctga agatgtaaac aaacaaatac actatttctg ggaacatt 418 

<210> 298 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig38288 
<400> 298 

tttagtggtc cgttgcctct gaagatgtaa acaaacaaat acactatttc tgggaacatt 60 

<210> 299 
<211> 413 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig38901 
<400> 299 

tacatttttg tttaatgttg ggcctgaggt taactgtgac catggtccag cttgagtggc 60 
ttctggagca gccacatttt caaggactgt ccaaaagcca gccagttcag ggctcaggcc 120 
tcacccattg cccactcctg gggagaccat cacctggctc atcgtttcca ccaagagtgc 180 
cccacaggag tgccccacag acccgctgga ccagcctgct gcgggtcctg gccaggggtc 240 
tggctaacgg tgagggctga ctctgaactg tctctcagtc tccagaaagt gttcaagcct 300 
gttgtgttcc caaatctgat tcctcctatt gtcttgtaaa tcaaactcta agtgaaaact 360 
tcccatttgt cccttcaaag attttttttt attaaatggt tttttaagat cct 413 

<210> 300 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig38901 
<400> 300 

tgttcccaaa tctgattcct cctattgtct tgtaaatcaa actctaagtg aaaacttccc 60 

<210> 301 
<211> 434 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig40434 



gaatggtgaa agagagatgc cgtgttttga aagtaagatg atgaaatgaa tttttaattc ■ 60 

aagaaacatt cagaaacata ggaattaaaa cttagagaaa tgatctaatt tccctgttca 120 

cacaaacttt acactttaat ctgatgattg gatattttat tttagtgaaa catcatcttg 180 

ttagctaact ttaaaaaatg gatgtagaat gattaaaggt tggtatgatt tttttttaat 240 

gtatcagttt gaacctagaa tattgaatta aaatgctgtc tcagtatttt aaaagcaaaa 300 

aaggaatgga ggaaaattgc atcttagacc atttttatat gcagtgtaca atttgctggg 360 

ctagaaatga gataaagatt atttattttt gttcatatct tgtacttttc tattaaaatc 420 

attttatgaa atcc 434 

<210> 302 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig40434 
<400> 302 

aaggaatgga ggaaaattgc atcttagacc atttttatat gcagtgtaca atttgctggg 60 

<210> 303 

<211> 391 

<212> DNA 

<213> Homo sapiens 



<400> 301 



<300> 
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<308> Contig40552 
<400> 303 

caccaagccc tgctccggca cctcgaatcc ctggcgacca tgagtcacca gctccaagcc 60 

ttactgtgcc cccagaccaa gagctccatc ccccgccctc tgcagcgttt gtctagcgcc 120 

cttgcagctc cagagccccc tggcccagcc cgtgactcct ctttggggcc tacagatgaa 180 

gctggctctg agtgtccctt ccctagaaag gcctgaccct ccttacccac cagaacaggg 240 

gttttgatgc cctcactagt gttgaagcct gttccagaga gaggtgggac tgcaaggaga 300 

ggatggtcag ccctacccac ctgccctgtt tgagcttcct gtttgacaat gtttgctgtt 360 

gattttttgt tcaataaaga atttggtaaa a 391 

<210> 304 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig40552 
<400> 304 

tttgagcttc ctgtttgaca atgtttgctg ttgatttttt gttcaataaa gaatttggta 60 

<210> 305 
<211> 495 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig41413 
<400> 305 

aaatattctt aatagggcta ctttgaatta atctgccttt atgtttggga gaagaaagct 60 
gagacattgc atgaaagatg atgagagata aatgttgatc ttttggcccc atttgttaat 120 
tgtattcagt atttgaacgt cgtcctgttt gttgttagtt ttcttcatca tttattgtat 180 
agacaatttt taaatctctg taatatgata cattttccta tcttttaagt tattgttacc 240 
taaagttaat ccagattata tggtccttat atgtgtacaa cattaaaatg aaaggctttg 300 
tcttgcattg tgaggtacag gcggaagttg gaatcaggtt ttaggattct gtctctcatt 360 
agctgaataa tgtgaggatt aacttctgcc agctcagacc atttcctaat cagttgaaag 420 
ggaaacaagt atttcagtct caaaattgaa taatgcacaa gtcttaagtg attaaaataa 480 
aactgttctt atgtc 4 95 

<210> 306 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig41413 
<400> 306 

cagctcagac catttcctaa tcagttgaaa gggaaacaag tatttcagtc tcaaaattga 60 

<210> 307 
<211> 409 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig41538 
<400> 307 
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aaaaaaaaaa aaaaaaaaaa aaagagttgt tttctcatgt tcattatagt tcattacagt 60 

tacatagtcc gaaggtctta caactaatca ctggtagcaa taaatgcttc aggcccacat 120 

gatgctgatt agttctcagt tttcattcag ttcacaatat aaccaccatt cctgccctcc 180 

ctgccaaggg tcataaatgg tgactgccta acaacaaaat ttgcagtctc atctcatttt 240 

catccagact tctggaactc aaagattaac ttttgactaa ccctggaata tctcttatct 300 

cacttatagc ttcaggcatg tatttatatg tattcttgat agcaatacca taatcaatgt 360 

gtattcctga tagtaatgct acaataaatc caaacatttc aactctgtt 409 

<210> 308 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig41538 
<400> 308 

ctcatgttca ttatagttca ttacagttac atagtccgaa ggtcttacaa ctaatcactg 60 

<210> 309 
<211> 552 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig41887 
<400> 309 

ctgaagacta cgaccatgaa atcacagggc tgcgggtgtc tgtaggtctt ctcctggtga 60 

aaagtgtcca ggtgaaactt ggagactcct gggacgtgaa actgggagcc ttaggtggga 120 

atacccagga agtcaccctg cagccaggcg aatacatcac aaaagtcttt gtcgccttcc 180 

aagctttcct ccggggtatg gtcatgtaca ccagcaagga ccgctatttc tattttggga 240 

agcttgatgg ccagatctcc tctgcctacc ccagccaaga ggggcaggtg ctggtgggca 300 

tctatggcca gtatcaactc cttggcatca agagcattgg ctttgaatgg aattatccac 360 

tagaggagcc gaccactgag ccaccagtta atctcacata ctcagcaaac tcacccgtgg 420 

gtcgctaggg tggggtatgg ggccatccga gctgaggcca tctgtgtggt ggtggctgat 4 80 

ggtactggag taactgagtc gggacgctga atctgaatcc accaataaat aaagcttctg 540 
cagaatcagt gc 552 

<210> 310 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig41887 



<400> 310 

tactggagta actgagtcgg gacgctgaat ctgaatccac caataaataa agcttctgca 60 

<210> 311 
<211> 745 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig42342 
<400> 311 

gcagtaaaga caggacgcac ccatgtcaca agaggagcac aggcaggggt gttggtgttg 60 
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gggcagccct cagggtctcc agacccagcc ccactcacac agcagcctag gaaggaaggg 120 

cagagtccca ggtgtcagct ggtgggtctc ccaggagctg cccctccctg gaagtcacag 180 

gacaggaatg acagatcagg gaactgcagg aagctgccac ctctggggtc agaatatgcc 240 

cagcctgcgg gggctctcta tcggggtctt cgagagccag acagcctgcc ttgtgctgca 300 

tacctggctt tgctctgtgc agaacccagc acacgtgatt ttgtgtgaca tgccagcagc 360 

ctggctccca ggacaggagg cctgccctgg gggaggggct gcaggaggag ggggggcagg 420 

cacccatgag tctgtccagc cttgtcacag atgcatcgcc caagctgcgg tcctgatttc 480 

agctcacctc agagtaaatc agaataaact gcacccagac tttcacgaat gcatgttgac 540 

gctttcagtt cacccctttc tttgctaact ttcttcctat tttcttctaa tgcgagagct 600 

tattaattcc atatttatca ttttgaataa cttttctcct ttttagtaac aaaatgtact 660 

tcactcttag taaaatgtat ttactatttt agtaacaaaa atatacttgc ctaatcatgt 720 
ttaaaatata gtgatgtgaa aaatt 745 

<210> 312 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig42342 
<400> 312 

cacccagact ttcacgaatg catgttgacg ctttcagttc acccctttct ttgctaactt 60 

<210> 313 

<211> 398 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> , Contig43645 
<400> 313 

agttcaaagg cagataaatc tgtaaattat tttatcctat ctaccatttc ttaagaagac 60 

attactccaa aataattaaa tttaaggctt tatcaggtct gcatatagaa tcttaaattc 120 

taataaagtt tcatgttaat gtcataggat ttttaaaaga gctataggta atttctgtat 180 

aatatgtgta tattaaaatg taattgattt cagttgaaag tattttaaag ctgataaata 240 

gcattagggt tctttgcaat gtggtatcta gctgtattat tggttttatt tactttaaac 300 

attttgaaaa gcttatactg gcagcctaga aaaacaaaca attaatgtat ctttatgtcc 360 

ctggcacatg aataaacttt gctgtggttt actaatct 398 

<210> 314 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig43645 
<400> 314 

gaaaagctta tactggcagc ctagaaaaac aaacaattaa tgtatcttta tgtccctggc 60 

<210> 315 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig44289 
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ctaaaaacaa cactcatcag tcttgggaaa tttgaacttt gatcaactta actaaagaag 60 

gaagggtagt aagaattttt caaatacaaa tatttgccaa ttcacagatg ataacattta 120 

aggccttcaa aagtaagggt ttttccttgt ttctccagtc agcttttgtc aactctaata 180 

gttttttcat aaacattttt tatttgtata attgcaacag tttaagaaat tatcacaact 24 0 

atttagaaac atttaaaatg ttctttttga tataagctat atacttggaa aaatacattg 300 

gtatctaaaa tttgaggtgt gttaagactg ctttttgttt taaaaaatgg tttacattca 360 

aatttttgaa gtgttttatg cttcatatgg ctaagttgta gtttggcaga gttaacagca 420 

taagaataaa catgctgtaa ttttaaaaga tgctttgaat aaaaatttat tttaattt 478 

<210> 316 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig44289 



<400> 316 

catcagtctt gggaaatttg aactttgatc aacttaacta aagaaggaag ggtagtaaga 60 

<210> 317 
<211> 556 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig44909 
<400> 317 

accatctggg atttctacag cctgggta'cc catagccaca ccaaggcttc tgggagattc 60 
tgcagggtca gctttccagg ctgttcccaa atagctc.cct gcctccccac tgcccctaaa 120 
gccacagcag aagagccatt catctcataa acaaaaagga agaggaaaga atgaggaagg 180 
accctgtgca aggttatttg caggcaggga tgggcttgta cctgacagca cccacccctg 240 
tgtggccccc aggccctcat caccctcaga cccctcctaa gcagttccct cattgctctt 300 
tggactaggc tgacagcagg aagagcaggg cccatgaccg ggtggaagtt cagttttggt 360 
gtctgcttca agagggggtt ttacactctg attccaggac aagcactctg aggcgggtgg 420 
gggagagaaa ccctggctct tcacccaggt ttcacacaca tgtaaatgaa acactatgtt- 480 
agtatctaac acactcctgg atacagaaca caagtcttgg cacatatgtg atggaaataa 540 
agtgttttgc aatctt 556 

<210> 318 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig44909 



<400> 318 

tcacccaggt ttcacacaca tgtaaatgaa acactatgtt agtatctaac acactcctgg 

<210> 319 
<211> 710 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig45032 



<400> 319 

aaagataggc ttctaagtta aggcaaatca ttcattctgt cattaaacaa atacaaacca 60 
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ggcacctgtc atatgccaag tgatattcaa aatggcccat gtagaccttt gtgaagtatg 120 

tggcctaaca gacattaaac aaatgtctgt gaaactgaca taataaagta aggtaagtta 180 

tatgtgagac attctctttt tataataatt cctgtaaagc agtacttact taggtaatga 240 

tatcatactg ttttgtttta tatttttcct aagagctaaa acgtcatcct ctcttcagtg 300 

atgtggactg ggaaaatctg cagcatcaga ctatgccttt catcccccag ccagatgatg 360 

aaacagatac ctcctatttt gaagccagga atactgctca gcacctgacc gtatctggat 420 

ttagtctgta gcacaaaaat tttcctttta gtctagcctc gtgttataga atgaacttgc 480 

ataattatat actccttaat actagattga tctaaggggg aaagatcatt atttaaccta 540 

gttcaatgtg cttttaatgt acgttacagc tttcacagag ttaaaaggct gaaaggaata 600 

tagtcagtaa tttatcttaa cctcaaaact gtatataaat cttcaaagct tttttcatct 660 
atttattttg tttattgcac tttatgaaaa ctgaagcatc aataaaatta 710 

<210> 320 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Cbntig45032 
<400> 320 

ttaacctagt tcaatgtgct tttaatgtac gttacagctt tcacagagtt aaaaggctga 60 

<210> 321 
<211> 726 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig46218 
<400> 321 

atacatattg ctttagagag caggtaggtg gccatgtgtt cagcagtgtg tccttaagaa 60 

aataccatct ttctaagcca ctggaatttt tactttacta tttttaacat taatggatgt 120 

caggtcatca acctcaagtc tttacatatc catgtatatt ccatatatat tgtttatata 180 

ggcccaagtt tctccttaat tgggatctat atactaccag cacaacatca aaaacatgta 240 

attgaataca tcagagctat atatgtaagg aaatgactgg tgaccccatt atcatcattg 300 

ttgaattcat gttaagtaga ccctctaggg gaccataagg caattgagca cataacgaaa 360 

aatgatgcaa taagaatgta tgcactctct ttgccaaatg catgtgcttt tgtgtaacgt 420 

ggatgtaaac agaattgcag tgctgccgaa attcttgatc ttggctaaga gagtattttt 480 

ccccttgtaa ttatgactct gagataaaat tgccattttg aaatttccaa agtaacaact 540 

ttttttattt tatgaataaa cttgggattg caatttctct gatctgacaa tcaataactt 600 

taacaaagat ctaaataagt gtttcaagga aagttttcct aagcaaatgt aatattacct 660 

catttgggca tcattactct gttaattcta tatcaaagga aataaacttg ctacttgcac 720 
taaatg 726 

<210> 322 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig4 6218 
<400> 322 

accataaggc aattgagcac ataacgaaaa atgatgcaat aagaatgtat gcactctctt 60 

<210> 323 
<211> 580 
<212> DNA 
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<213> Homo sapiens 



<300> 

<308> Contig47096 



<400> 323 

ggtggtctct catccttgtg tgctgctctg 

agatccttcc agactcatct gtcagagccc 

atgcctttgg gacattagga ctgcagcctt 

ttttgtcacc tggacacact catttgttag 



ctaagagatg tccaaggcgg agccggggca 60 

caagcccttt agacccagag cccaaggacc 120 

tgcttctgtg tattttggag ttttggtgac 180 

ccatagtggg ttcccttggt cagcaacagt 240 



gcatgtacct ctggatgtca tctgaggtga 

gaggggagtt aggagttgct ggactggatg 

acatccaggg atcagggcat gagtgggggc 

ccgcatgggc tgaggtgtag ctgattggct 

ccatgttgca gatattttcc cgagttcccc 

gcaacactgt gtccagcatt ctttgcaata 



gaccaccgag gccttttctc tctgtgtaca 300 

cattacgagg actggggaca gggtagaggg 360 

aaccccccgg cctctgccct ggcatggtct 420 

gccacatttc ggccatgctg gctggcgtgc 4 80 

agaatggatg gtattgaatc tcagccacat 540 
aatacttttt 580 



<210> 324 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig47096 



<400> 324 

atattttccc gagttcccca gaatggatgg tattgaatct cagccacatg caacactgtg 

<210> 325 
<211> 632 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig47563 



<400> 325 

gccatctagt ctgtggtttt ctgttgaagc 
gtagttataa accactttcc tgttgaaagc 
tagcaatgat agattttgtt taagtcccct 
acacttcctg gcagtgatta aggagtgtgt 
aacccgaata tggaattcag attgtttctg 
acaaacaaac ataagggaac aaacagcaac 
ttttgctgtg ataaatgaat acggtactct 
aaactgcaag tgatttaaaa attagagaat 
cagaaaacct caagtgaatt ttgtatgtaa 
tttaagcaaa caattagctg aaaacttggt 
gcacccatca gaaaataaag ggcattaaat 



agtctgaatt gactaaaaca gtcacttgga 60 

agaacatgct gattcaactg ttttgttcaa 120 

acactttctt atttctaaat gatcaagagt 180 

atctaacaga aaaaatatat ataccctgtg 240 

ccctcagtat catacttaaa aaacaagcat 300 

cataacaaaa acaaaccttt aaaggtgggt 360 

gaaggagaaa aaagtttctc aaatgagctt 420 

ataattctta aagctattga aagtttcaac 4 80 

atgaaatctt gaatgtaagt tctgtgattc 540 

attgttgtag tttatgtagt aagtgacttg 600 
tg 632 



<210> 326 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig47563 



<400> 326 

agcaaacaat tagctgaaaa cttggtattg ttgtagttta tgtagtaagt gacttggcac 60 
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<210> 327 
<211> 540 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig48913 
<400> 327 

accagagggt gtcccttttc cacagtaatg ggatcggctg gtgtgccttc agggaggaag 60 

agggaggtgg tcaagcttga aaaactggct ttaggatggt tctgactttg ttctccctcc 120 

ccaagtgttc tcaacctcca ttctgcagtg ttcagagttt tagggaaagg gtttgggtgc 180 

cccagcatcc aggtgttgtg tggcttagcg catgtgaagt gaaaaccttc tggggttgtt 24 0 

tggaagcagc tttctggttc ttgtgattgt atcctgaggt cccagaaccc tattctccca 300 

cgaggatcct cagtgaccat ggtggccaca cgcctggcca gcctgctggc tcctgggtga 360 

gctgaagaac cttgcctgtg gcacttttcg agggtgagct ggaaccgaga gaacatggtc 420 

cccgtgctgg gactcatgcg ggtcatttcc tgccggcctg gtttcgcctg gtcgtgtctt 480 

tatgagcacc atgtaagcct ccttgtattg agataattgg gcattaaaca ttaaactgca 540 

<210> 328 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig48913 
<400> 328 

tatgagcacc atgtaagcct ccttgtattg agataattgg gcattaaaca ttaaactgca 60 

<210> 329 

<211> 534 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig49169 
<400> 329 

cctaatgtta acatttttaa aaatacatat ttgggactct tattatcaag gttctaccta 60 

tgttaattta caattcatgt ttcaagacat ttgccaaatg tattaccgat gcctctgaaa 120 

agggggtcac tgggtctcat agactgatat gaagtcgaca tatttatagt gcttagagac 180 

caaactaatg gaaggcagac tatttacagc ttagtatatg tgtacttaag tctatgtgaa 240 

cagagaaatg cctcccgtag tgtttgaaag cgttaagctg ataatgtaat taacaactgc 300 

tgagagatca aagattcaac ttgccataca cctcaaattc ggagaaacag ttaatttggg 360 

caaatctaca gttctgtttt tgctactcta ttgtcattcc tgtttaatac tcactgtact 420 

tgtatttgag acaaataggt gatactgaat tttatactgt tttctacttt tccattaaaa 4 80 

cattggcacc tcaatgataa agaaatttaa ggtataaaat taaatgtaaa aatt 534 

<210> 330 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig49169 
<400> 330 

catacacctc aaattcggag aaacagttaa tttgggcaaa tctacagttc tgtttttgct 60 
<210> 331 
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<211> 602 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig49388 
<400> 331 

tgtcagtgga ggggtctctg cagccaactg agactatctt gctgtgccct gagccttcct 60 

agggtttaga agaacagcat tcaaaattcc ccgtcctgtc agtgtttgcc ttcgcacctc 120 

ctcccctaaa gcagcgcggg gggcaaataa gaccccaccc ctccctgcag cttcacaggg 180 

acgcttcctt ccctccccgc aaccacccca ggctcccctg ggaggctgca gttgtggtac 24 0 

acgtccccgg tgctgggttg gccgtgactc gggggcgggg cgatcgggtc tcagcccctg 300 

ccttccccag tctctgggtc acccgaattt tcccacccct gcttctcccc gaggaggttg 360 

agctcttgag caagttggga cttgggccgg ggcctggaag aatgattggc tgggaggccg 420 

cgggagggag gccaggaggc ccggaccagt tgggaggagt gagcaggccc cgggggaggg 480 

ggatgagcgc agtttgctcg ctttcctccc ctgccggccc cctccgcccc cacacacact 540 

cgggacgtct tcattgaaga ttcacttaca aaggaatgtt tcactaaata aaagaaaacc 600 
ag 602 

<210> 332 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig49388 
<400> 332 

cgggacgtct tcattgaaga ttcacttaca aaggaatgtt tcactaaata aaagaaaacc 60 

<210> 333 
<211> 562 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig50728 
<400> 333 

gcgaatttgg gccccttgat cctctgatgg gagctgaaag gatgagaggt gggcatctag 60 

atttagggag gctgttcagg ctttgcaggt cccttacctg aacacataga aaccctggag 120 

ctgtgactgt gtccatgtgt gtgtgtttgt ctgtgtgtgt tgcgggggat gggcacctgc 180 

atgaatgtgg tagagaaaat ggctctgctc agagggaaga tacgcatagc aaggcaggga 24 0 

ccagaggaat cacaggcgcc tggagagcag ccgggcaccg cctccaggga cctgccggct 300 

tccctcagtc ctccaggggc ccagcactct tcctttaggc cctgtgagcg tcccttgtca 360 

ggatacattc tctcattttg ctgaagctga tttgattggg tgtctgtttc tcgcagccaa 4 20 

aagagctctg aatgaggaaa gtgcttctgt gctaactccc cgcgtctcct gaatttcagt 480 

cattcatgta cccgcctcga aatttttgca atatctgtgt accaactgtc catttactta 540 
ataaagaagt tttctttaaa tt 562 

<210> 334 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig50728 
<400> 334 

tttgattggg tgtctgtttc tcgcagccaa aagagctctg aatgaggaaa gtgcttctgt 60 
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<210> 335 
<211> 400 



<212> DNA 

<213> Homo sapiens 



<300> 

<308> AI497657 



120 
180 



<400> 335 

tttttttttt tgcacttatg gtatttattg ttggaagatt gagtacctta atgcacacca 

atgctcagat gacttggggg cacatagggg actgctgtca ccatgcctca ctcctgcagg 

gaaggggctg ccctactaaa accccagcgg gcccagtgct gtgtccagaa caggtcctta xou 

tattactgca gcccacaatg gaactactga gtaggagcca aaagaggagg gagcaggaag 24 0 

aggtggcatt tggagagggg agaccgcacc cacaggtctg ccacagcgcg tcaacggtat 300 

ggggtacttt tacagtcaag ttgacttcgg tgtccgccca ccatctacct ttgtaggacc 360 

actgaaacaa gggacatcca ccacggccca cagccggggc 400 

<210> 336 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AI497657 
<400> 336 

gagcattggt gtgcattaag gtactcaatc ttccaacaat aaataccata agtgcaaaaa 60 

<210> 337 
<211> 475 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig50950 
<400> 337 

ctggaagagg ctcccaaccc agagtgtccc tgtgggaggc aggcagaagg tgacaattga 60 
cacgatttcc tgcacgcgtc ctcctctacc ttggaagcag ttagaatcta ccaggcacag 120 
atgaggccgc ccttgcctga cggagcttga tgagcagccc ttggtctccg gttccaggac 180 
tgagagccca gctgcctctg cccacccttc cccaggcctc tgccagcctc tggctgcacg 240 
gtcaggccct gccccatggc aggcctgcca gagcttggct ggggacccct cccgcctctg 300 
gctccctgat gggctggatg taacttgtgt cttctagccc cttaaggagc ccaggtgttt 360 
taaggaatga attggtcact gcatcttgta tcgattatgg ttctgagaaa agcaaatatc 420 
acttttggct gcattaaaag aagcatcata tataaaataa agaagatgaa ggtct 475 

<210> 338 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig50950 
<400> 338 

gtcactgcat cttgtatcga ttatggttct gagaaaagca aatatcactt ttggctgcat 60 

<210> 339 
<211> 860 



145 



WO 2006/084272 PCT/LS2006/004280 



<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig51660 
<400> 339 

ggatggcaac cttcagctag actgcctggc tcaagggtgg aagcaatacc aacagagagc 60 

atttggctgg ttccggtgtt cctcctgcca gcgaagttgg gcttccgcca agtgcagatt 120 

ctgtgccaca cgtactggga gcactggaca tcccagggtc aggtgcgtat gaggctcttt 180 

ggccaaaggt gccagaagtg ctcctggtcc caatatgaga tgcctgagtt ctcctcggat 240 

agcaccatga ggattctgag caacctggtg cagcatatac tgaagaaata ctatggaaat 300 

ggcatgagga agtctccaga aatgccagta atcctggaag tgtccctgga aggatcccat 360 

gacacagcca attgtgaggc atgcactttg ggcatatgtg gacagggctt aaaaagctac 420 

atgacaaagc cgtccaaatc cctactcccc cacctaaaga ctgggaattc ctcacctgga 480 

attggtgctg tgtacctcgc aaaccaagcc aagaaccagt cagatgaggc aaaagaggct 540 

aaggggagtg ggtatgagaa attagggccc agtcgagacc cagatccact gaacatctgt 600 

gtctttattt tgctgcttgt atttattgta gtcaaatgct ttacatcaga atgatgaaaa 660 

taggcttgcc actttctctt attttaattc catggtagtc aatgaactgg ctgccacttt 720 

aatataactg aaaattcatt ttgagaccaa gcaggatcaa gtttgtagaa taaacactgg 780 

tttcctagcc atcctctgaa aacagtatga aacatgacca agtacataat ggatttagta 840 
ataaatattg tcgaattgct 8 60 

<210> 340 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig51660 
<400> 340 

gctgcttgta tttattgtag tcaaatgctt tacatcagaa tgatgaaaat aggcttgcca 60 

<210> 341 
<211> 608 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig52490 
<400> 341 

atcgtggcta gcggacagac acgagcctct tgggaatacc ttgtccatca cgtcatggcc 60 
atgggtgcct tcttctccgg catcttttgg agcagctttg tcggtggggg tgtcttaaca 120 
ctactggtgg aagtcagcaa catcttcctc accattcgca tgatgatgaa aatcagtaat 180 
gcccaggatc atctcctcta ccgggttaac aagtatgtga acctggtcat gtactttctc 240 
ttccgcctgg cccctcaggc ctacctcacc catttettct tgcgttatgt gaaccagagg 300 
accctgggca ccttcctgct gggtatcctg ctcatgctgg acgtgatgat cataatctac 360 
ttttcccgcc tcctccgctc tgacttctgc cctgagcatg tccccaagaa gcaacacaaa 420 
gacaagttct tgactgagaa ctgagtgagg ggcacagagc ctgggacaac aaaaacggac 4 80 
aaggccagaa acagcttcat atggacactg ggacttagcc ccaagcctgg gtgtcctctg 540 
aggccagcct ctccaccttc tgagcctgcg cccacactat tgaaaacact aatgaaagta 600 
ctcctctg 608 

<210> 342 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 
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<308> Contig52490 
<400> 342 

ccaggatcat ctcctctacc gggttaacaa gtatgtgaac ctggtcatgt actttctctt 60 

<210> 343 
<211> 1282 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig53598 
<400> 343 

catgccagca cctttgaacc ggtctcttag aagaagacac acatcctggg tgtacagtgg 60 

tgaaatgggg agtgggtgcc cattctgaaa aacgaggcat tcctgctcat tccctctgot 120 

tagctggtgg gcaggggaga gagggaaatg ccaaaaactt ggagtgaagg atgatgctat 180 

tttttatttt taaatatatc ttcaggttat tttcttactg ttgcttcaga tctaatgtaa 240 

aaggcagatg tcccctcctc tccacccccg acgctgaccc cggcctcagt cacggctctt 300 

tgcatgatca cagttctgtg ttctggcctg tggcagggcc gggaagggcc gctggcttcc 360 

gaacagacgt ggttgctctc cacgaggcgc atggggagcc cgcgggccct aagctttgtc 4 20 

gcagatgtca tcattggcag aattacttgt cttgaaaaat aagtagcatt gctgaaacac 480 

acaaccgaat tctctacgat ggccatttgc tcattgtctt tcctctgtgt gtagtgagtg 540 

accctggcag tgtttgcctg ctcagagtgg cccctcagaa caacagggct ggccttggaa 600 

aaaccccaaa acaggactgt ggtgacaact ctggtcaggt gtgatttgac atgagggccg 660 

gaggcggttg ctgacggcag gactggagag gctgcgtgcc cggcactggc agcgaggctc 720 

gtgtgtcccc caggcagatc tgggcacttt cccaacccag gtttatgcgt ctccagggaa 7 80 

gcctcggtgc cagagtggtg ggcagatctg accatcccca cagaccagaa acaaggaatt 840 

tctgggatta cccagtcccc cttcaaccca gttgatgtaa ccacctcatt ttttacaaat 900 

acagaatcta ttctactcag gctatgggcc tcgtcctcac tcagttattg cgagtgttgc 960 

tgtccgcatg ctccgggccc cacgtggctc ctgtgctcta gatcatggtg actcccccgc 1020 

cctgtggttg gaatcgatgc cacggattgc aggccaaatt tcagatcgtg tttccaaaca 1080 

cccttgctgt gccctttaat gggattgaaa gcacttttac cacatggaga aatatatttt 1140 

taatttgtga tgcttttcta caaggtccac tatttctgag tttaatgtgt ttccaacact 1200 

taaggagact ctaatgaaag ctgatgaatt ttcttttctg tccaaacaag taaaataaaa 1260 
ataaaagtct atttagatgt tg 1282 

<210> 344 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig53598 
<400> 344 

ccactatttc tgagtttaat gtgtttccaa cacttaagga gactctaatg aaagctgatg 60 

<210> 345 
<211> 601 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig53641 
<400> 345 

tggaggctgt ggatgatgct ttcaagacaa tggatgtgga tatggccgag gaacatgcca 60 
gggcccagat gagggcccag atgaatatcg gggatgaagc gctgattgga cggtggagct 120 
gggatgacat acaagtcgag ctcctgacct gggatgagga cggagatttt ggcgatgcct 180 
gggccaggat cccctttgct ttctgggcca gataccatca gtacattctg aatagcaacc 240 
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gtgccaacag gagggccacg tggagagctg gcgtcagcag tggcaccaat ggaggggcca 300 

gcaccagcgt cctagatggc cccagcacca gctccaccat ccggaccaga aatgctgcca 360 

gagctggcgc cagcttcttc tcctggatcc agcaccgttg acgaactgca gcgatcttac 420 

tggccaagcc agagcgcctc ctctcagatt ccttctcgac acagcaccct aggcggcttc 480 

ttcctgtcag tcggaggtgg catgcaagat gaagctctct ttgctcttcc tgctttcatt 540 

ttgtgctttt ccttgtgttt tcatgttttg ggtatcagtg ttacattaaa gttgcaaaat 600 
t 601 

<210> 346 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig53641 
<400> 346 

ctttcatttt gtgcttttcc ttgtgttttc atgttttggg tatcagtgtt acattaaagt 60 

<210> 347 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig54242 
<400> 347 

aattactcaa agaaggagcc atttcagtta actcaagtga atgaaagact tttggaatct 60 

gcagtgggtc cttccctgtt gaccatttgg taacttgtaa tctgaccaaa aaotcttgag 120, 

ctgcaacagg ccttgccaga gggctcagga tgggaaagga agaaggggat aggaaaagaa 180 

gaggtaattt tacatttccc ctttaaagta aattttagcc aactcatcat tctgaaatgt 240 

ccctataaag aatgagtcga actagaccag aagccagcct actccttctt acatagcttc 300 

tccaacaggg gtagcaatga cctgtccact tcaaacacag ataaggcctg ccatcctcat 360 

tggttaaagg cacacgtgag actttcagtg ggctctgctg agaaggaagg cagcccagga 420 

gtcaggtatg caggcattgc attgtcagtg tctgctctca gagtttacac attcaattgc 480 

ttccaagggt gaatctcctg ctctgtgaat gctatcaga'c cccaaaggcc aaccttgggc 540 

tgggtctatg tacgttcttc cgaagcactg atgatcaaaa ttgaagacac attcagaggt 600 

ttgattggtt gagattaact ggtgtggtgg ttggtgtatg tatgttttat ttttatgtct 660 

ttgtatgtag ttctacataa tgcaaattgt gctttctgat ggacaagacc tcataactgt 720 
gattaatatc aataaaaagg ggatgttgtg g 7 51 

<210> 348 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig54242 
<400> 348 

gtaaatttta gccaactcat cattctgaaa tgtccctata aagaatgagt cgaactagac 60 

<210> 349 

<211> 637 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig54661 
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ggcagtga^g tctatgttga gattaactta tgtattgagg aaaatttgaa Jtttattttt 

?cga?gaata aggctgtcaa atgatttagt atagattaat gacatctttt "agaaatat 120 

taaagtgagt attcctcatt atgtcatoat ttctgataat tagagtgcta atttgaatgt 180 

tagaLatgt ttccacatct atacctattt ctttctaggg cacttctgao cctggggctt 240 

ggggatggcc tttaggccac agtagtgtct gtgttaagtt cactaaatgt Jatttaatg 300 

lgaaaca?tc ctatgtaaaa atgtgtgtat gtgaacgtat gcatacattt "attgtgca 360 

cctgtacatt gtgaagaagt agtttggaaa tttgtaaagc acaaaccata aaagagtgtg 420 
aaattattaa atgatgtagc acaaatgtaa tgtttagctt ataaaaggtc ctttctattt . 480 

?cfatggcaa agact?tgac acttgaaaaa taaaaccaat atttgattta tttttgtaag 540 

tatttaggat attattttaa ataaatgatt gtccattatc aatataatag ttgtgaaatg 600 
atttaagtaa ataaacttta tgcttctgtg tctgttg 637 

<210> 350 
<211> 60 

<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig54661 

cJgJacatSg tgaagaagta gtttggaaat ttgtaaagca caaaccataa aagagtgtgg 60 

<210> 351 
<211> 924 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig55188 



gcgacaagta ccgcaagcgg gcactcatcc tggtgtcact gctggccttt S°°TO«*£ 
?cgccgccct cgtgctgtgg ctgtacatct accccattaa ctggccctgg atcgagcacc 
tcacc?gctt ccccttcacc agccgcttct gcgagaagta tgagctggac caggtgctgc 
actgaccgct gggccacacg gctgcccctc agccctgctg gaacagggtc tgcctgcgag 
ggctgccctc tgcagagcgc tctctgtgtg ccagagagcc agagacccaa gacagggccc 
gggc?ctgga cctgggtgcc cccctgccag gcgaggctga ctccgcgtga ^tggttggt 
Saggcgggg tttttctggg gcgtgaggcc tgtgagatcc tgacccaagc tcaggcacac 
ccaaggcacc tgcctctctg agtcttgggt ctcagttcct aatatcccgc tccttgctga 
gaccatctcc tggggcaggg tccttttctt cccaggtcct cagcgctgcc tctgctggtg 
ccttctcccc cactactact ggagcgtgcc cttgctgggg acgtggctgt ^cctcagtt 
gcccccaggg ctgggtgccc accatgcccc ttcctctttc tcctcctacc tctgccctgt 
gagcccatcc ataaggctct cagatgggac attgtgggaa aggctttggc catggtctgg 
gggcagagaa caagggggga gacacaagta gacctcaggt agaacgacac tgggcggagc 
caccccaggg cctgctccca gggagtgctc gaggcgcatc aggcccgttt tttaccagtt 
tatatcacgg tct?catttt taaaagtaac gctaactttg tacggacgat gtctcatgga 
ttaaataata ttctttatgg cagt 924 



120 
180 
240 
300 
360 



540 
600 
660 
720 
780 
840 
900' 



<210> 352 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig55188 



agJaacgcta actttgtacg gacgatgtct catggattaa ataatattct ttatggcagt 
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<210> 353 
<211> 699 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig55353 
<400> 353 

tgattatgcc aagagctcta aacagaagtt tgagaaggta aaaattaagt tgtagtatct 60 

gagttgtttt tattttcttc ctttggtgtt tatgaaggta ttcataagaa ctttaatttc 120 

aggggaaaaa atgcctgatt tgctattttt gacatttcct cgtctcttaa gaagtcagtt 180 

aaatatgttt tcatagttta tattcctgtt tcatagatta ctgtgaaaca tgtatttaaa 240 

cctatgaatt ataaaatagt atttagattc tagcgtgagt taaatagatt agtcatatat 300 

cttttagatt tgtggatttg acatgtaaat tatgtgttgt gtataagtaa gttagttact 360 

aaacatatgg catggttatt gataaacttg ttgctatttt tttccaaatg ctatcagtgt 420 

ttgtggactt ttaaaaatta gtttgaattt tggaatgttc tgtgataaaa tataatttca 480 

actattttgt acatttaaat atgccatgtt gtatatgtct gtatttaaaa atgttgtaaa 54 0 

tatctgcatt ttaagaatta tgaaagattt tcctcaaaaa tgacagaact ctccatactt 600 

aattgtgaca cattataaga tatctgattt taagcttttg gattttgttc taaaaattaa 660 
gtttaaacat gctgaaaatt ccataaaaat aaaattttg 699 

<210> 354 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig55353 
<400> 354 

taaaatagta tttagattct agcgtgagtt aaatagatta gtcatatatc ttttagattt 60 

<210> 355 
<211> 809 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig56503 
<400> 355 

gcatgtgaga tgagtgactg ccggtgaatg tgtccacagt tgagaggttg gagcaggatg 60 
agggaatcct gtcaccatca ataatcactt gtggagcgcc actctgccca agacgccacc 120 
tgggcggaca gcatggagct ctccatggcc aggctgcctg tgtgcatgtt ccctgtctgg 180 
tgcccctttg cccgcctcct gcaaacctca cagggtcccc acacaacagt gccctccaga 240 
agcagcccct cggaggcaga ggaaggaaaa tggggatggc tggggctctc tocatcctcc 300 
ttttctcctt gccttcgcat ggctggcctt cccctccaaa acctccattc ccctgctgcc 360 
agcccctttg ccatagcctg attttgggga ggaggaaggg gcgatttgag ggagaagggg 420 
agaaagctta tggctgggtc tggtttcttc ccttcccaga gggtcttact gttccagggt 4 80 
ggccccaggg caggcagggg ccacactatg cctgcgccct ggtaaaggtg acccctgcca 54 0 
tttaccagca gccctggcat gttcctgccc cacaggaata gaatggaggg agctccagaa 600 
actttccatc ccaaaggcag tctccgtggt tgaagcagac tggatttttg ctctgcccct 660 
gaccccttgt ccctctttga gggaggggag ctatgctagg actccaacct cagggactcg 720 

ggtggcctgc gctagcttct tttgatactg aaaactttta aggtgggagg gtggcaaggg 780 
atgtgcttaa taaatcaatt ccaagcctc 809 



<210> 356 
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<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig56503 
<400> 356 

gaaaactttt aaggtgggag ggtggcaagg gatgtgctta ataaatcaat tccaagcctc 60 

<210> 357 
<211> 976 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig56678 
<400> 357 

gaaggatata ctttgttata acttattatt ttgttctctg taaatacaag atgtttatag 60 

gaaatatgta ttctgaactc tatctgcaga atgagtcact acaccaaaat agttctatta 120 

tttagaatgt gttaatttta aagggacctg ataggtattt atttacatat gcgatccaca 180 

tttgtgtgaa agcatgtgat catactaacc cagcctcctg gaatgtcgct gtacgatgat 240 

tgatgtcttt ttctcagtcc atagttacaa ttgtttagta tgctaatcag tccagttccc 300 

tgaggtttaa gatcaaatat aaattactct gcttttcgac tcattcaggt agcattgtac 360 

ctgaacctga ttgctacttt ttcatcttaa atattatatt tcctcatcta atctgccttc 420 

ccctcatcca cagacatttg gagaaggaaa tgggagggtg tctgttatcc ctttctcttt 4 80 

gctttgtccc cgttgttaga ctggcagcgt cagttgctcg gtgggcttgg ttagagccgt 540 

gggtgaggca ggtggctggc ggggacaggg agaggctgag agggaagtgg tggcatttac 600 

tgctctgaca cttccactgt ccctgctggg gatgctgggg ccaaggcctg tggggcctgt 660 

gaactgcaca gccaggagca aggaacccac taaatactcc gtcacctcca tgtcccctct 720 

acagtgttaa attattacat aagcaggtga aaggtagaag gcgaattatg tgagtaaata 780 

tggtctgttt tctcttcagc aaaaatgact atttttgtgt gtgactaatt tatttttatt 840 

attgtaaaga tacaataaac cggttgaaat atctgctttg ttgacaagcg tgtgctttct 900 

ctggccttat tcgcgttctg ttctcctgca aatagcgccc tctaaaaaga agagtcagac 960 
aataaactgg ttgaaa 97 6 

<210> 358 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig56678 
<400> 358 

tattacataa gcaggtgaaa ggtagaaggc gaattatgtg agtaaatatg gtctgttttc 60 

<210> 359 
<211> 1118 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig57584 
<400> 359 

agctgttgtg catccagagg tggaattggg gcccggcatt ccctcctcgt cccgggctgg 60 
cccttgcccc caccctgcaa ctcctggttg agatgggctc agccaagagc gtcccagtca 120 
caccagcgcg gcctccgccg acaacaagca tctggctcga gtggcggacc cccgttcacc 180 
tagtgctggc atcctgcgca ctcccatcca ggtggagagc tctccacagc caggcctacc 240 
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agcaggggag caactggagg gtcttaaaca tgcccaggac tcagatcccc gctctcctac 300 
tcttggtatt gcacggacac ctatgaagac cagcagtgga gaccccccaa gcccactggt 360 

gaaacagctg agtgaagtat ttgaaactga agactctaaa tcaaatcttc ccccagagcc 420 

tgttctgccc ccagaggcac ctttatcttc tgaattggac ttgcctctgg gtacccagtt 480 

atctgttgag gaacagatgc caccttggaa ccagactgag ttcccctcca aacaggtgtt 540 

ttccaaggag gaagcaagac agcccacaga aacccctgtg gccagccaga gctccgacaa 600 

gccctcaagg gaccctgaga ctcccagatc ttcaggttct atgcgcaata gatggaaacc 660 

aaacagcagc aaggtactag ggagatcccc cctcaccatc ctgcaggatg acaactcccc 720 

tggcaccctg acactacgac agggtaagcg gccttcaccc ctaagtgaaa atgttagtga 780 

actaaaggaa ggagccattc ttggaactgg acgacttctg aaaactggag gacgagcatg 84 0 

ggagcaaggc caggaccatg acaaggaaaa tcagcacttt occttggtgg agagctaggc 900 

cctgcatggc cccagcaatg cagtcaccca gggcctggtg atatctgtgt cctctcaccc 960 

cttctttccc agggatactg aggaatggct tgttttctta gactcctcct cagctaccaa 1020 

actgggactc acagctttat tgggctttct ttgtgtcttg tgtgtttctt ttatattaaa 1080 
ggaagtaatt ttaaatgtta ctttaaaaag gtatatgt 1118 

<210> 360 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig57584 
<400> 360 

aggaatggct tgttttctta gactcctcct cagctaccaa actgggactc acagctttat 60 

<210> 361 
<211> 859 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig63649 
<400> 361 

gtcgcagggt accagtgtgc ggagttcctg ttgccaagct gaaggtggcc ctgggcaggc 60 

acaggtgtgg tcatatcttc agccaacagg accatcctcc ggagggccac ctctggggac 120 

ttcctacggg aagagagtga cagatttggt gcttctgtgt gtttctgccg cttcagtggg 180 

gccgctgcgg gagacagcgg gtggatcctc cagcagcctg tctgctgagc ctgccttctc 240 

aagtctactg ttaaaatcag gaccgggtcg tgtccgagcc tacaggccct gtctccgctc 300 

cccaggcctg caggagttga gggctgcacc tgctcgctgg agagggagag gcagatttag 360 

tggacgcctg gcatggactc ggactggcct ttggaagctc cctgccctga cgggttgcct 420 

gtcaccactg cgaagtgagg cttggaggac ctgcacctga gaaaggctgt gtgtggtctt 480 

gggtccacac ctgccagagc taacttactg ccagacggcg acttactgtg ggccaccctc 54 0 

agtgaaccgg ggtgtcctca gctggcccta cagagcactt ctgtgctggg gatgagtagg 600 

aactctgggc gaggagggtc ccagcgccgc ccctcgatac agccctgctc tgccctctgc 660 

ccgtacttat accaggtggg atccctgccc tgcattgcct ggggattggc tgggcttggg 720 

cacgccctgc tgtggaactg gatgttttca gggagcccag cctttcctca tgtcaacaca 780 

gttcacaata tagttttcaa agtacagttt aaaactcaaa agtaaacttt tcagcaactc 840 

aaaaaaaaaa aaaaaaaaa 859 

<210> 362 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig63649 
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<400> 362 

cagcctttcc tcatgtcaac acagttcaca atatagtttt caaagtacag tttaaaactc 60 

<210> 363 
<2H> 1170 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig63525 
<400> 363 

gccatggctc cctgggcgga gcgagcactc gcggctgaac ccgctgcgcg cggtgtggct 
cacgctgacc gccgccttcc tgctgaccct actgctgcag ctcctgccgc ccggcctgct 
cccgggctgc gcgatcttcc aggacctgat ccgctatggg aaaaccaagt gtggggagcc 
gtcgcgcccc gccgcctgcc gagcctttga tgtccccaag agatattttt cccactttta 
tatcatctca gtgctgtgga atggcttcct gctttggtgc cttactcaat ctctgttcct 
gggagcacct tttccaagct ggcttcatgg tttgctcaga attctcgggg cggcacagtt 
ccagggaggg gagctggcac tgtctgcatt cttagtgcta gtatttctgt ggctgcacag 
cttacgaaga ctcttcgagt gcctctacgt cagtgtcttc tccaatgtca tgattcacgt 
cgtgcagtac tgttttggac ttgtctatta tgtccttgtt ggcctaactg tgctgagcca 
agtgccaatg gatggcagga atgctacata acagggaaaa atctattgat gcaagcacgg 
tggttccata ttcttgggat gatgatgttc atctggtcat ctgcccatca gtataagtgc 
catgttattc tcggcaatct caggaaaaat aaagcaggag tggtcattca ctgtaaccac 
aggatcccat ttggagactg gtttgaatat gtttcttccc ctaactactt agcagagctg 
atgatctacg tttccatggc cgtcaccttt gggttccaca acttaacttg gtggctagtg 
gtgacaaatg tcttctttaa tcaggccctg tctgcctttc tcagccacca attctacaaa 
agcaaatttg tctcttaccc gaagcatagg aaagctttcc taccattttt gttttaagtt 
aacctcagtc atgaagaatg caaaccaggt gatggtttca atgcctaagg acagtgaagt 
ctggagccca aagtacagtt tcagcaaagc tgtttgaaac tctccattcc atttctatac 
cccacaagtt ttcactgaat gagcatgcag tgccactcaa gaaaatgaat ctccaaagta 
tcttcaaaga attaattact aatggcagat 1170 

<210> 364 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig63525 
<400> 364 

ctcttacccg aagcatagga aagctttcct accatttttg ttttaagtta acctcagtca 60 

<210> 365 
<211> 632 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig64688 
<400> 365 

aagaatgcta agatgatttc agatatcgaa aagaaaaggc agcgtatgat tgaagtccag 
gatgaactgc ttcggttaga gccacagctg aaacaactac aaacaaaata tgatgaactt 
aaagagagaa agtcttccct taggaatgca gcatatttct tatctaattt aaaacagctt 
tatcaagatt attcagatgt tcaagctcaa gaaccaaacg taaaggaaac gtatgattca 
tccagccttc cagctctgtt atttaaagca agaacacttc tgggagccga aagccatctg 
cgaaatatca accatcagtt agagaagctc cttgaccagg gatgagaaga gcagtctact 
aaaatgtgcc tataggaaga ctagtctcat gctgttacct tctgaaactg tacctttata 
aatcaattgt tttgcaaaga agttatggcc tacttagaat ctaaaatttg ttattcaaat 
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taaatggctg tgaacaatgt taaatagcat cagtttgtcc aatagtttta aaggccataa 540 
tcatcttttc tggttaatat cttgagtaat tttaaaatgt tgacacctta atcggtccca 600 
ggtatgagcc ataataaact tgtaaaatta ag 632 

<210> 366 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig64688 
<400> 366 

ggctgtgaac aatgttaaat agcatcagtt tgtccaatag ttttaaaggc cataatcatc 60 



<210> 367 
<211> 2596 
<212> DNA 

<213> Homo Sapiens . 
<300> 

<308> AL080059 
<400> 367 

gatgtcacct gctttatttc tggctttggc ctgtggtctg tgatacccat cctgcttgat 60 
gttctgcaga atggcacttg actgctgggc atgcatgaag ttaagggcaa gaaacagtat 120 
gccatgtgtt ctgtaccatc atgtgtctct tottgcttct gggcccttct actggtgaac 180 
tttcatcaag atctgcgcca tgccgtgtca ctatcaagcc attaagtttt gtctgggttg 240 
ctgtcagccc cagttggctt cctggtcaac aaggacctca agaactgcct gtggaccgag 300 
gccccctacc agtgcatgag acacacacct accctcccca gctttccagg aaccctactg 360 
gctgccagac tgatgggcgg gctggtatgt gtggacatgt gttcactgtc attatgctgt 420 
ggctccaggt gagggtgagg actgggccta tatagaatcc agataccatt gtcaacttcc 480 
ottattcccg tctaagatgt gagcagagtg ccatagtagg ggttctggga agaggtattt 540 
ctgatttgtg ggcctctgct tgcttgactt caggtcactt atacttctta ttttgcttgc 600 
ctgccttcat ccctcatttc ctccctctca ttcttctttc ctccctccct ttcctggtag 660 
cctcctttcc tccccttctg ccttcccctt ccttctttcc ttattctttt ttattttgtt 720 
taaatagtac cacagagaaa acaactgaaa aaccacattt ttctacatac agctggggag 7 80 
gtagctgaga acttggcact gcgcacacat actaggttga aagagagttg aggaaaccag 840 
aaggccaagt ggatctgctg gcaaaccctg aacctgtctc ctgcgcttgc tctacagttc 900 
tgaagttgaa aatcgttttc atgcctagca tctgcttgag ttataaaccc caaggcagcc 960 
atgtcataga ctagtgttta ctcttgtttt gactttgttt taatgcttcc taagacccaa 1020 
gtgcctcctg ctgtttcctc ctttgtggta gcctctggcc atctggacct caatccccag 1080 
ctttcccact ttcagcagtc ctttgctctc tttgcttcta cctcaaatag ccccaggagt 1140 
gggctttagt ctccaatatg gagcatctca agcttctcct gggggatggg gattgggatg 1200 
ggcggaatct gttttggatc tccgggttat ttccagtggg tgtaaaagca gagctgggcc 1260 
tttccctctc ttatccctga gggtgggtaa gaaggactgt atctacacct gttcttccct 1320 
accttctctt ttgttaggga ggcctcattc taagttcctc aagagagtcc ttggcttaaa 1380 
gctgtagcaa gggtgtgcta ggtgggggat ttggagcaaa accgtcgagt aggcatgata 14 4 0 
ctggtatgga gtgggcctgc aaaatcagac agaaatggct tgagaagccg cagggggagc 1500 
atgcctgtct ctcagtgata gagtatggga gggacctccc tagottggaa aatgagaatt 1560 
gaaggggtta tgaacaaata ggatgcctag ttgaggatgt tcccaaagtt ttgtccaatc 1620 
ttatcattag tagattttat aagccacaga gacaaaccag aaacggaata atgttacttt 1680 
ggatgcttta tttttttgtt ctaggtgtgg ctttgtacat gcagaagaat gctatatgct 1740 
gcacattttg cctttaaagt cttacgactt. tccccatttt agtctaatgg gaagatacag 1800 
atgtgcaagt ctgctttttt gttttttgtt attatttttt tttttgctct gtgttatgga 1860 
cattttcaga catgcacaga agtggagagg atggtccttg gaccccatgt gtccatcacc 1920 
tagctgcatc acttatcagc tatggtcaac ctggtttcat ctgtatctct ctcttttcac 1980 
ctgtattgtt tattgaaaat ccaagacact atgccaatgc aaccgtgact actttgggag 2040 
attggtagtc tcttttgatg gtgatagtga tggggtgcac tatcataatc acatcaggtc 2100 
tgctttttgc ttttaatgtt aactaatgaa gttccagaga tgggccttag aaatgtgttt 2160 
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taagaattaa caaggagtct caaaaagaaa 
acaaaacaag agagagactg ttctgttgta 
gtacttgaga cccttcacca gaatgtcaat 
tgaccattag cattgttatc agcttgtact 
tttggaaatt gttgctgtgt tctgtgaaaa 
gttctacttg gtaatttgac accctgttaa 
tataaatagt tgtaagtttg catgcatgat 
aaaaaaaaaa aaaaaa 

<210> 368 
<211> 443 
<212> DNA 

<213> Homo Sapiens 



tgagagggat gcttcctttc ccttgcat.ct 2220 
aaactctttc aaaaattctg atatggtaag 2280 
ctttttttct gtgtaacatg gaaacttgtg 2340 
ggtctcataa ctctggtttt ggaagaataa 2400 
taacctcccc aaaataatta gtaactggtt 2460 
taacgcaatt atttctgtgt tcttaaacag 2520 
ggaaaaataa aaacctgtat ctctgtcaaa 2580 
2596 



<300> 

<308> AA555029 



<400> 368 






tttttttttt 


tttttccatt 


aacccaacac 


ggagtatttg 


tttctttggt 


agttggtaga 


attaagcatt 


cgatcatgtt 


acaaaacaat 


aaatctgcct 


gaagcaaaat 


gtgataaact 


cataaagttg 


attctcaaac 


acgggtttga 


gataaatata 


ttgaaaattt 


ttttggaaat 


ccatgtagct 


tagaaatact 


gaaaaactaa 


tatgtagaca 


ctagtctatt 


tta 


<210> 369 






<211> 3160 






<212> DNA 






<213> Homo 


sapiens 




<300> 






<308> NM_00374 8 





aagtttattt actaagtcaa gtcaaactga 60 
caaagaatac atatacatat tctatttccc 120 
ggcctagaga actatcattg aagatttacc 180 
gcagaaatgg tggagaatga cactggaagt 240 
actgcatagg accgcttata tgcatatttt 300 
ttgcaacaat ttaaaaaact tgcagatgca 360 
gaaaaaggtg tgtcatgaat tcatataata 420 
443 



<400> 369 

attctcgccg cgcccgggcg gacgatccag 
gctgccggcg cccgcgctcc gccgcgccct 
gcggtggaag cacacctcct ccctgaaggt 
gggcagccct gagcgagatg ccctgcaaaa 
agccatccca tgcgtggtgg gggatgagga 
gtcgcctttt aaccatggac ataaggtggc 
caacaaagcc attgaggctg ccctggctgc 
agaccgggcc cagatcttcc tgaaggcggc 
gatcctcgcc aagaccatgg tgggacaggg 
tgcagcggaa ctcatcgact tcttccggtt 
gcagcagccc atcagcgtgc ccccgagcac 
cttcgtggcg gccatctcgc cctttaactt 
accggccctg atgggcaacg tggtcctatg 
ctatgctgtc taccgcatcc ttcgggaggc 
gccagctgat gggcccctat ttggggacac 
caacttcaca ggcagtgtgc ccaccttcaa 
ggaccggttc cacaccttcc cacgcctggc 
cgtgcaccgc tcggccgacg tggagagcgt 
gtacggtggc cagaagtgtt ccgcctgctc 
gcagatcaaa gggcggctgc tggaggagca 
ggattttggg accttcttct ctgcagtgat 
gtggctggag cacgcgcgct cctcgcccag 
tgactccgtg ggctactttg tggagccctg 
catcatgaag gaggagatct tcgggcctgt 
gtacaaggag acgctgcagc tggttgacag 



cgaacagccc cgcttctaac ccgagatgct 60 
gctgtcccgc ccctggaccg gggccggcct 120 
ggccaacgag cccgtcttag ccttcacgca 180 
ggccttgaag gacctgaagg gccggatgga 240 
ggtgtggacg tcggacgtgc agtaccaagt 300 
caagttctgt tatgcagaca agagcctgct 360 
ccggaaagag tgggacctga agcctattgc 420 
agacatgctg agtgggccgc gcagggctga 480 
taagaccgtg atccaagcgg agattgacgc 540 
caatgccaag tatgcggtgg agctggaggg 600 
caacagcacg gtgtaccggg gtctggaggg 660 
cactgcaatc ggcggcaacc tggcgggggc 720 
gaagcccagt gacactgcca tgctggccag 780 
tggcctgccc cccaacatca tccagtttgt 840 
tgtcaccagc tcagagcacc tctgtggcat 900 
acacctgtgg aagcaggtgg cccagaacct 960 
tggagagtgc ggcggaaaga acttccactt 1020 
ggtgagcggg accctccgct cagccttcga 1080 
gcgtctctac gtgccgcact cgctgtggcc 1140 
cagtcggatc aaagtgggcg accctgcaga 1200 
tgatgccaag tcctttgccc gtatcaagaa 1260 
cctcaccatc ctggctgggg gcaagtgtga 1320 
catcgtggag agcaaggacc ctcaggagcc 1380 
actgtctgtg tacgtctacc cggacgacaa 1440 
caccaccagc tatggcctca cgggggcagt 1500 
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gttctcccag gataaggacg tcgtgcagga ggccacaaag gtgctgagga atgctgccgg 1560 

caacttctac atcaacgaca agtccactgg ctcgatagtg ggccagcagc cctttggggg 1620 

ggcccgagcc tctggaacca atgacaagcc agggggccca cactacatcc tgcgctggac 1680 

gtcgccgcag gtcatcaagg agacacataa gcccctgggg gactggagct acgcgtacat 1740 

gcagtgagcc cctctcgggc tccaccgtcc agctgtctgt ccgtccaggt ggccgacctc 1800 

actgcacaga ccccactcca gcccctccac cccttcttca tgcacagctg cctttctata 1860 

atccgggctt gactcccttc ttaccactgt attctggcct ctcccatgcc tcaggctctg 1920 

gtttgagatc gtgctgggga ggaacatggc cactacccct tatcccatcg gccatgtggg 1980 

aggtatgacc ctggtgcctg gcaggttctc cctctgccct ccactgggcc cagtggctca 2040 

gggacctggg gaaaggagat ggagcagctc ttgggatcct ttggggaaaa ggaggccatt 2100 

ctgggcccct bggcaaacct caccactcac agaggctcct ggccttgatc cctgcccctc 2160 

caggtgtcca gggtaaagtg taactcagac tgacctgtgg ggcacagggg gcaccagctg 2220 

gccttgccct ctctggtctg ggctgtctac cttcctcact gtatctttgc ccagacccac 2280 

ctgggccagt aggcccctgt ccccagccac acaccttaga tgctggcatg ccttactcca 2340 

ggtgcctgtg tttggccgag gcctgtgtga ttcccggtct gcaccacatg gcggggttgg 2400 

ggggccgctg gaggccacct gccaaggcgt gggatgggat ggtcctgccg gtttaggccg 24 60 

tgattctgga aaaccttgga tgggccttcg tcctatgtca gccttccctt tgatcctcag 2520 

gccctacctg tagagacctc cactcctaga gccagtctca gggtctggga tttccctgca 2580 
ggagctcagc caccactgtg ccatggtgac acaggccaag gcagacattg gccctccctt 2 640 
ctcccagccc ccagaggcct ggccttgggt tcgtcagcat gggccgagga cgttgcctgt 2700 
agaatcctcc tctgcctggg agtggctctg tgtggaccag tccctcactg gcccattctt 2760 
tttttgacgc agccaatctg tgaccacgat tcctcccaca gatgcctcct gcttggattc 2820 
tgagtggtca gagatctgta aagcatgact ttcaaggatg gttcttaggg gactgtgaaa 2880 
gtgttgggtc ttcctccagg atgcctgcat gggaccccac ccggagctgg tgtggccatt 2940 
ccccaagtgc cactggccca tggatggggg tgggtgctgg tgccagctgg gctgggtgtg 3000 
ggttctgtgt ccttccagga tatgtgtcat ttcccatgag gggccggggc aggtggctgg 3060 
gtgggggcac aggctggagt attcttagtt ctactggttc tacactgtga ggtggcaatg 3120 
ggatttgctc agatgccacc caataaaatg cctgttactt 3160 

<210> 370 
<211> 1546 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003862 
<400> 370 

cacggccgga gagacgcgga ggaggagaca tgagccggcg ggcgcccaga cggagcggcc 60 
gtgacgcttt cgcgctgcag ccgcgcgccc cgaccccgga gcgctgaccc ctggccccac 120 
gcagctccgc gcccgggccg gagagcgcaa ctcggcttcc agacccgccg cgcatgctgt 180 
ccccggactg agccgggcag ccagcctccc acggacgccc ggacggccgg ccggccagca 240 
gtgagcgagc ttccccgcac cggccaggcg cctcctgcac agcggctgcc gccccgcagc 300 
ccctgcgcca gcccggaggg cgcagcgctc gggaggagcc gcgcggggcg ctgatgccgc 360 
agggcgcgcc gcggagcgcc ccggagcagc agagtctgca gcagcagcag ccggcgagga 420 
gggagcagca gcagcggcgg cggcggcggc ggcggcggcg gaggcgcccg gtcccggccg 480 
cgcggagcgg acatgtgcag gctgggctag gagccgccgc ctccctcccg cccagcgatg 540 
tattcagcgc cctccgcctg cacttgcctg tgtttacact tcctgctgct gtgcttccag 600 
gtacaggtgc tggttgccga ggagaacgtg gacttccgca tccacgtgga gaaccagacg 660 
cgggctcggg acgatgtgag ccgtaagcag ctgcggctgt accagctcta cagccggacc 720 
agtgggaaac acatccaggt cctgggccgc aggatcagtg cccgcggcga ggatggggac 780 
aagtatgccc agctcctagt ggagacagac accttcggta gtcaagtccg gatcaagggc 840 
aaggagacgg aattctacct gtgcatgaac cgcaaaggca agctcgtggg gaagcccgat 900 
ggcaccagca aggagtgtgt gttcatcgag aaggttctgg agaacaacta cacggccctg 960 
atgtcggcta agtactccgg ctggtacgtg ggcttcacca agaaggggcg gccgcggaag 1020 
ggccccaaga cccgggagaa ccagcaggac gtgcatttca tgaagcgcta ccccaagggg 1080 
cagccggagc ttcagaagcc cttcaagtac acgacggtga ccaagaggtc ccgtcggatc 1140 
cggcccacac accctgccta ggccaccccg ccgcggcccc tcaggtcgcc ctggccacac 1200 
tcacactccc agaaaactgc atcagaggaa tatttttaca tgaaaaataa ggaagaagct 12 60 
ctatttttgt acattgtgtt taaaagaaga caaaaactga accaaaactc ttggggggag 1320 
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gggtgataag gattttattg ttgacttgaa acccccgatg acaaaagact cacgcaaagg 1380 
gactgtagtc aacccacagg tgcttgtctc tctctaggaa cagacaactc taaactcgtc 14 40 
cccagaggag gacttgaatg aggaaaccaa cactttgaga aaccaaagtc ctttttccca 1500 
aaggttctga aaggaaaaaa aaaaaaaaac aaaaaaaaaa aaaaaa 154 6 

<210> 371 
<211> 376 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig32125_RC 
<400> 371 

caatttgtat ttttctcaat ataatgttgc aagatgtcat tatattctca ttacccataa 60 
cttcccatcc aaatcttaat ggcacatttg gtattttttc aaatggcttt gagatataat 120 
tcacatacca cacaattcat gtatctaaag tatacagttc tgtagttctg tggttgttgg 180 
ttacatccac cagactgagg gctccctgag ggttggtgcc tcagctttcc cttcacttac 240 
tcattcacca aacattcagc gcctactgag tactggggtt atgatggcca acaggagaga 300 
cagtctctgc ctactgtttg gtgggggtgg agtacttagt aaccgtcgtc attattgagt 360 
gcttacctgt gctggg 376 

<210> 372 
<211> 1639 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> U82987 
<400> 372 

ccgccccctc cggcgtgttc atgcccccgg ggccccaggg agcgccatgg cccgcgcacg 60 
ccaggagggc agctccccgg agcccgtaga gggcctggcc cgcgacggcc cgcgcccctt 120 
cccgctcggc cgcctggtgc cctcggcagt gtcctgcggc ctctgcgagc ccggcctggc 180 
tgccgccccc gccgccccca ccctgctgcc cgctgcctac ctctgcgccc ccaccgcccc 24 0 
acccgccgtc accgccgccc tggggggttc ccgctggcct gggggtcccc gcagccggcc 300 
ccgaggcccg cgcccggacg gtcctcagcc ctcgctctcg ctggcggagc agcacctgga 360 
gtcgcccgtg cccagcgccc cgggggctct ggcgggcggt cccacccagg cggccccggg 4 20 
agtccgcggg gaggaggaac agtgggcccg ggagatcggg gcccagctgc ggcggatggc 4 80 
ggacgacctc aacgcacagt acgagcggcg gagacaagag gagcagcagc ggcaccgccc 54 0 
ctcaccctgg agggtcctgt acaatctcat catgggactc ctgcccttac ccaggggcca 600 
cagagccccc gagatggagc ccaattaggt gcctgcaccc gcccggtgga cgtcagggac 660 
tcggggggca ggcccctccc acctcctgac accctggcca gcgcggggga ctttctctgc 720 
accatgtagc atactggact cccagccctg cctgtcccgg gggcgggccg gggcagccac 780 
tccagcccca gcccagcctg gggtgcactg actgagatgc ggactcctgg gtccctggcc 8 40 
aagaagccag gagagggacg gctgatggac tcagcatcgg aaggtggcgg tgaccgaggg 900 
ggtggggact gagccgcccg cctctgccgc ccaccaccat ctcaggaaag gctgttgtgc 960 
tggtgcccgt tccagctgca ggggtgacac tggggggggg gggggctctc ctctcggtgc 1020 
tccttcactc tgggcctggc ctcaggcccc tggtgcttcc ccccctcctc ctgggagggg 1080 
gcccgtgaag agcaaatgag ccaaacgtga ccactagcct cctggagcca gagagtgggg 1140 
ctcgtttgcc ggttgctcca gcccggcgcc cagccatctt ccctgagcca gccggcgggt 1200 
ggtgggcatg cctgcctcac cttcatcagg gggtggccag gaggggccca gactgtgaat 12 60 
cctgtgctct gcccgtgacc gccccccgcc ccatcaatcc cattgcatag gtttagagag 1320 
agcacgtgtg accactggca ttcatttggg gggtgggaga ttttggctga agccgcccca 1380 
gccttagtcc ccagggccaa gcgctggggg gaagacgggg agtcagggag ggggggaaat 1440 
ctcggaagag ggaggagtct gggagtgggg agggatggcc cagcctgtaa gatactgtat 1500 
atgcgctgct gtagataccg gaatgaattt tctgtacatg tttggttaat tttttttgta 1560 
catgattttt gtatgtttcc ttttcaataa aatcagattg gaacagtgga aaaaaaggcc 1620 
ttcgtggcct cgagagatc 1639 



<210> 373 
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<211> 2273 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AL137718 
<400> 373 

gggggccagt taccatattg tccggagtgg gtccttgaca ttcaggaaag atacatcctc 60 
agacctttca aaatttgtca tgcctctcta acctaaaaca gcqctcatta aatgcacttt 120 
aatccgagca ctgtatggct tggaaagaat tatgagtgag gagaggagcc tttccttatt 180 
ggccaaagcc gtggatccca gacaccccaa tatgatgaca gatgtggtta aacttctctc 240 
tgcggtatgc attgtagggg aagaaagcat ccttgaagaa gttttagaag ctttaacttc 300 
agctggtgaa gaaaaaaaaa ttgacagatt tttttgtatt gtggaaggcc tccggcacaa 360 
ttcagttcaa ctgcaagtag cttgtatgca gctcatcaat gccctggtta catctcctga 420 
tgatttggat ttcaggcttc acatcagaaa tgaatttatg cgttgtggat tgaaagagat 480 
attgccaaat ttaaaatgca ttaagaatga tggcctggat atccaactta aagtctttga 540 
tgagcataaa gaagaagatt tgtttgagtt atcccatcgc cttgaagata ttagagctga 600 
acttgatgaa gcatatgatg tttacaacat ggtgtggagc acagttaaag aaactagagc 660 
agagggatat tttatttcta ttcttcagca tcttttgctg attcgaaatg attattttat 720 
aaggcaacaa tacttcaaat taattgatga gtgtgtatcc cagattgtat tgcatagaga 780 
tggaatggat ccagacttca catatcgaaa aagactagat ttagatttaa cccagtttgt 840 
agacatttgc atagatcaag caaaactaga agagtttgaa gagaaagcat cagaacttta 900 
caagaaattt gaaaaagagt ttaccgacca ccaagaaact caggctgaat tgcagaaaaa 960 
agaggcaaag attaatgagc ttcaagcaga gctacaagct tttaagtctc agtttggtgc 1020 
cttgccagct gattgtaata ttcctttgcc tccctctaaa gaaggtggaa ctggccactc 1080 
agcacttcct cctccgcctc cactgccttc tggtggaggg gtgccgcctc cacctcctcc 1140 
cccaccacct cctccacttc caggaatgcg gatgccattc agtggtcctg tgcctccacc 1200 
acctcccctg ggattccttg gaggacaaaa ttctcctcct ctaccaatcc tgccatttgg 12 60 
gttgaaacca aagaaagaat ttaaacctga aatcagcatg agaagattga attggttaaa 1320 
gatcagacct catgaaatga ctgaaaactg tttctggata aaagtaaatg aaaataagta 1380 
tgaaaacgtg gat.ttgcttt gtaaacttga gaatacattt tgttgccaac aaaaagagag 1440 
aagagaagag gaagatattg aagagaagaa atcgattaag aaaaaaatta aagaacttaa 1500 
gtttttagat tctaaaattg cccagaacct ttcaatcttc ctgagctctt ttcgggtgcc 1560 
atatgaggaa atcagaatga tgatattgga agtagatgaa acacggttgg cagagtctat 1620 
gattcagaac ttaataaagc atcttcctga tcaagagcaa ttaaattcat tgtctcagtt 1680 
caagagtgaa tatagcaact tatgtgaacc tgagcagttt gtggttgtga tgagcaatgt 1740 
gaagagacta cggccacggc tcagtgctat tctctttaag cttcagtttg aagagcaggt 1800 
gaacaacatc aaacctgaca tcatggctgt cagtactgcc tgcgaagaga taaagaagag 1860 
caaaagcttt agcaagttgc tggaacttgt attgctaatg ggaaactaca tgaatgctgg 1920 
ctcccggaat gctcaaacct tcggatttaa ccttagctct ctctgtaaac taaaggacac 1980 
aaaatcagca gatcagaaaa caacgctact tcatttcctg gtagaaatat gtgaagagaa 2040 
gtaccctgat atactgaatt ttgtggatga tttggaacct ttagacaaag ctagtaaagg 2100 
tttgtgcctt tttaagaaac atttcatggc tctgatcttc agtgctaaaa gactgaaaat 2160 
tatacctttt atttgtatgt actttccact ttctcacagt gttttcatac ccaatatctc 2220 
attttaaaaa taaagagagt tgtaggatat taaaaaaaaa aaaaaaaaaa aag 2273 

<210> 374 

<211> 2782 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AB037863 
<400> 374 

gcggccgcgg cggggcggcg 
gctgctgctc caggtgctgc 
gggcgctggc ggcggcggcg 
gcggctgctg ctgctcctgc 
gcccgcagcc tccgcctccc 



cgggaaccgg gccccggggg gagtcggccg ggctgctgct 60 
cgcccggggc tacggcaggg ccgggacgcc ggggcacgcg 120 
tctgctggcc cggcgcggcc ccagcctttc cccgggacgc 180 
cgccgctgcc gccactgtcg ccgccgccgc cgagctccgc 24 0 
ggatggacgc tctgccccgc agcgggctga acctgaagga 300 
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ggagccgctg ctgcccgccg 

cctggacgcc agcaccgcgg 

gcagcctccc tccaacctca 

ccggcagggg cagcccgtgg 

ccgagagccc ggggcggaaa 

taacaatgga ctgcggacag 

acaggccatc atctatgagg 

ccatgagatc atgtgcagcc 

gccctcagac cccgtcatca 

gaactgcctg aagaatgcgg 

cacgacggtg agcgtggacg 

caactccaag catggccgca 

catcaaggcc atcagccccg 

cggcgacaac ttcttcgacg 

gctcatcacg ccccacgcca 

ggaggtgacc ctctcctaca 

ctacacagct ctgaacgagc 

tcccagacac cccggagacc 

cgacttggca gaagccctgt 

cgcggcggac gtggccgagg 

cctggccccg agccacccac 

ggccatcgcc gtcggggacg 

cagcgcgtcc ccccgcgggt 

cggcctcgga gctggcctgg 

tgggtccccc agcttcctca 

tcgcccccgt gctgcgcccc 

aggggcttcc agaccagtct 

ttcagggcct ggcatactcc 

aggtccctct gggattcaca 

ggccatgggc ggacctcaac 

gcctccccgc tagcagcccc 

ccccagcact gtgcagatgc 

ggcctcactg tctgggtctt 

gttgagtgag gggagcgtgg 

acctggaaag gtcctgtttg 

tgcagcgaaa gggtggcgcc 

ccaacagctg cttggccccg 

tgccgcatca tgcctccctg 

aggcctcctg tatttggggt 

ctcacaatga ctctggctgg 

attctgctct gctaagtttg 

ccggcctctc acctgcctcc 

<210> 375 
<211> 716 
<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_020188 
<400> 375 

acggtggagc ggtggagggc 

gctccctagc agccggcctt 

ccggtgtggc cacgagtcgg 

tcctgactta tctccacact 

ggaatgtcac aaaaatcaca 

ggagttgaga aaatgcctga 

tggcattgca atgcgaaaga 

attttcactc gatgccttgg 

aatcctgtct tatttggtct 



gcctgggctc agtgcgctcc 
cgcagagtgg cgtgggtctg 
ggaaatccaa cttcttccac 
aggtggagcg cacagccttc 
agactaacaa tgggatccat 
agcaagacct ctacgtgcgt 
ggcaggacaa gaaccccgaa 
ggtgctgtga ccggaagagc 
ttgacaggtt cttcctcaag 
ggaatcccag agacatgcgc 
gacacgtgct ggccgtgtcc 
gggcgcgccg cctggacccc 
gggagggctg gaccacgggc 
ggttgcaggt cgtgttcgga 
tccgggtgca gacgcccccg 
agtccaagca gttttgcaag 
ccaccattga ctacggattc 
ccgagaggct gcccaaggaa 
acggagtgcc cggcagtaac 
ctctgtacag caccccccgc 
actccgccgt cgtgggcatc 
ccaccccggg gcccgagccg 
tcgcgcccag ccccggctcg 
gcggctacgg cgcgccgggc 
atggctccac cgccacctcg 
ccaagctccc caccccaggc 
tttgaggatt ctgacaagtt 
taattacggt ctgcagctgt 
gccacacccc ggatggtggc 
ccgtgagctg aacggggaga 
acaggcttct ctcgcctccc 
tcttggcagg acagcatcgc 
ggttcctctg aaagagatgg 
gatggggacc gtgggaaaga 
catctgaccc atctcaactg 
ccgcagcctc gggaggcctg 
cagctgcccc cacttccttt 
tgggggcttt gggcatggag 
cttcccccag tggatgtctc 
ccccacgcag cgggcccagc 
gagaaaacag aacaataaac 
tt 



tggatgcagg gcgcgggcat 360 
gcacgagcac attttgagaa 420 
ttcgtgctgg ccatgtacga 480 
atcgacttcg tggaaaagga 540 
taccgcctcc ggctggtgta 600 
ctcatcgact ccatgtccaa 660 
atgtgccgag tgctgctcac 720 
tgtggcaacc ggaatgagac 780 
ttcttcctca aatgcaacca 84 0 
cgcttccagg tggtggtgtc 900 
gacaacatgt ttgtgcacaa 960 
tccgaagctg ccaccccctg 1020 
ggcgccaccg tcattgtcat 1080 
aacgtgctcg tgtggagcga 1140 
cggcacatcc ccggggtggt 1200 
ggatgccccg gccgctttgt 1260 
cagaggctac agaaagtcat 1320 
gtgctgctga agcgggcggc 1380 
caggagctgc tcctgaagcg 1440 
gcacccgggc cgctcgcacc 1500 
aacgccttca gcagcccgct 1560 
ggctacgcgc gcagctgcag 1620 
cagcagagcg gctacggcgg 1680 
gtggccggcc tcggcgtgcc 1740 
cccttcgcca aggagcgcct 1800 
ctgccccaga gcccacggag 18 60 
ccactctcca gcccgggggc 1920 
tcccatggag cccggactgg 1980 
acagacagat gcagggccag 2040 
ggccttcacc ccatgctcaa 2100 
tgtcttgggg tagtcagaag 2160 
agggaggtgc tgggattctg 2220 
atcttgtgca gaccagggtt 2280 
ggacagctca gggagaagtg 2340 
gcccagcatc ccaacttctc 2400 
cccaggctcc cgtggagctt 24 60 
gagacctgca ctctcatgct 2520 
gaggcagaag agggggtgcc 2580 
atggactctg gccccacaca 2640 
cgccccccag gtggcctcac 2700 
cagatgcagg tggtgcccgc 27 60 
2782 



gtcactgggt ttcggcgtct 
cgggtcgggc gtctccgcgg 
gttgcactgc tgtgatccat 
tgcacactga agaatgcaac 
acattctgaa attttttggt 
agaatgagta cgtagaaaac 
aactttttaa tcctccagag 
ctgagagaag acctaaagac 
ccataatcct ttgaatggaa 



ggcaagcggt tcagctgtct 60 
ctactgccgc ttcagttctc 120 
cctcatctcc taaagatgca 180 
gtcttgatta acttgcttaa 240 
tattgtaatg atgttgatcg 300 
aggaccaaga gcagggagca 360 
gaatccgaaa aataaattgt 420 
tctgggttga tacctgaaag 480 
agtgacctgt gagagattga 540 
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accatggaga aatatgaaaa ccctggattc tgagtatttg ttgggcaggg cgtttagtac 600 
tgtctcccct ttaccagcaa acctgacttc accatgttta ttccctttgc ctacaaccag 660 
ttaatatctg agtaacttat ctccttcaat aaaataattt aaataaaaaa aaaaaa 716 

<210> 376 
<211> 3737 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_020974 
<400> 376 

ggcgtccgcg cacacctccc cgcgccgccg ccgccaccgc ccgcactccg ccgcctctgc 60 
ccgcaaccgc tgagccatcc atgggggtcg cgggccgcaa ccgtcccggg gcggcctggg 120 
cggtgctgct gctgctgctg ctgctgccgc cactgctgct gctggcgggg gccgtcccgc 180 
cgggtcgggg ccgtgccgcg gggccgcagg aggatgtaga tgagtgtgcc caagggctag 240 
atgactgcca tgccgacgcc ctgtgtcaga acacacccac ctcctacaag tgctcctgca 300 
agcctggcta ccaaggggaa ggcaggcagt gtgaggacat cgatgaatgt ggaaatgagc 360 
tcaatggagg ctgtgtccat gactgtttga atattccagg caattatcgt tgcacttgtt 420 
ttgatggctt catgttggct catgacggtc ataattgtct tgatgtggac gagtgcctgg 480 
agaacaatgg cggctgccag catacctgtg tcaacgtcat ggggagctat gagtgctgct 540 
gcaaggaggg gtttttcctg agtgacaatc agcacacctg cattcaccgc tcggaagagg 600 
gcctgagctg catgaataag gatcacggct gtagtcacat ctgcaaggag gccccaaggg 660 
gcagcgtcgc ctgtgagtgc aggcctggtt ttgagctggc caagaaccag agagactgca 720 
tottgacctg taaccatggg aacggtgggt gccagcactc ctgtgacgat acagccgatg 780 
gcccagagtg cagctgccat ccacagtaca agatgcacac agatgggagg agctgccttg 840 
agcgagagga cactgtcctg gaggtgacag agagcaacac cacatcagtg gtggatgggg 900 
ataaacgggt gaaacggcgg ctgctcatgg aaacgtgtgc tgtcaacaat ggaggctgtg 960 
accgcacctg taaggatact tcgacaggtg tccactgcag ttgtcctgtt ggattcactc 1020 
tccagttgga tgggaagaca tgtaaagata ttgatgagtg ccagacccgc aatggaggtt 1080 
gtgatcabtt ctgcaaaaac atcgtgggca gttttgactg cggctgcaag aaaggattta 114 0 
aattattaac agatgagaag tcttgccaag atgtggatga gtgctctttg gataggacct 1200 
gtgaccacag ctgcatcaac caccctggca catttgcttg tgcttgcaac cgagggtaca 1260 
ccctgtatgg cttcacccac tgtggagaca ccaatgagtg cagcatcaac aacggaggct 1320 
gtcagcaggt ctgtgtgaac acagtgggca gctatgaatg ccagtgccac cctgggtaca 1380 
agctccactg gaataaaaaa gactgtgtgg aagtgaaggg gctcctgccc acaagtgtgt 1440 
caccccgtgt gtccctgcac tgcggtaaga gtggtggagg agacgggtgc ttcctcagat 1500 
gtcactctgg cattcacctc tcttcagatg tcaccaccat caggacaagt gtaaccttta 1560 
agctaaatga aggcaagtgt agtttgaaaa atgctgagct gtttcccgag ggtctgcgac 1620 
cagcactacc agagaagcac agctcagtaa aagagagctt ccgctacgta aaccttacat 1680 
gcagctctgg caagcaagtc ccaggagccc ctggccgacc aagcacccct aaggaaatgt 1740 
ttatcactgt tgagtttgag cttgaaacta accaaaagga ggtgacagct tcttgtgacc 1800 
tgagctgcat cgtaaagcga accgagaagc ggctccgtaa agccatccgc acgctcagaa 1860 
aggccgtcca cagggagcag tttcacctcc agctctcagg catgaacctc gacgtggcta 1920 
aaaagcctcc cagaacatct gaacgccagg cagagtcctg tggagtgggc cagggtcatg 1980 
cagaaaacca atgtgtcagt tgcagggctg ggacctatta tgatggagca cgagaacgct 2040 
gcattttatg tccaaatgga accttccaaa atgaggaagg acaaatgact tgtgaaccat 2100 
gcccaagacc aggaaattct ggggccctga agaccccaga agcttggaat atgtctgaat 2160 
gtggaggtct gtgtcaacct ggtgaatatt ctgcagatgg ctttgcacct tgccagctct 2220 
gtgccctggg cacgttccag cctgaagctg gtcgaacttc ctgcttcccc tgtggaggag 2280 
gccttgccac caaacatcag ggagctactt cctttcagga ctgtgaaacc agagttcaat 2340 
gttcacctgg acatttctac aacaccacca ctcaccgatg tattcgttgc ccagtgggaa 2400 
cataccagcc tgaatttgga aaaaataatt gtgtttcttg cccaggaaat actacgactg 24 60 
actttgatgg ctccacaaac ataacccagt gtaaaaacag aagatgtgga ggggagctgg 2520 
gagatttcac tgggtacatt gaatccccaa actacccagg caattaccca gccaacaccg 2580 
agtgtacgtg gaccatcaac ccacccccca agcgccgcat cctgatcgtg gtccctgaga 2640 
tcttcctgcc catagaggac gactgtgggg actatctggt gatgcggaaa acctcttcat 2700 
ccaattctgt gacaacatat gaaacctgcc agacctacga acgccccatc gccttcacct 2760 
ccaggtcaaa gaagctgtgg attcagttca agtccaatga agggaacagc gctagagggt 2820 
tccaggtccc atacgtgaca tatgatgagg actaccagga actcattgaa gacatagttc 2880 
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gagatggcag gctctatgca tctgagaacc atcaggaaat acttaaggat aagaaactta 2940 
tcaaggctct gtttgatgtc ctggcccatc cccagaacta tttcaagtac acagcccagg 3000 
agtcccgaga gatgtttcca agatcgttca tccgattgct acgttccaaa gtgtccaggt 3060 
ttttgagacc ttacaaatga ctcagcccac gtgccactca atacaaatgt tctgctatag 3120 
ggttggtggg acagagctgt cttccttctg catgtcagca cagtcgggta ttgctgcctc 3180 
ccgtatcagt gactcattag agttcaattt ttatagataa tacagatatt ttggtaaatt 3240 
gaacttggtt tttctttccc agcatcgtgg atgtagactg agaatggctt tgagtggcat 3300 
cagcttctca ctgctgtggg cggatgtctt ggatagatca cgggctggct gagctggact 3360 
ttggtcagcc taggtgagac tcacctgtcc ttctggggtc ttactcctcc tcaaggagtc 3420 
tgtagtggaa aggaggccac agaataagct gcttattctg aaacttcagc ttcctctagc 3480 
ccggccctct ctaagggagc cctctgcact cgtgtgcagg ctctgaccag gcagaacagg 354 0 
caagagggga gggaaggaga cccctgcagg ctccctccac ccaccttgag acctgggagg 3600 
actcagtttc tccacagcct tctccagcct gtgtgataca agtttgatcc caggaacttg 3660 
agttctaagc agtgctcgtg aaaaaaaaaa gcagaaagaa ttagaaataa ataaaaacta 3720 
agcacttctg gagacat 3737 

<210> 377 

<211> 3304 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000127 
<400> 377 

gcgaccgaac gcggcggtcg gcagcgttcg cgcgggggcc tgcgaagcgc tgctcggggc 60 
cggcactgcc cgcggggagg acgcgccgcc gccgccaccc agcgccgccg ccgccgccgc 120 
ctccagccgg gccgccgcgc gtcccggggg ccggccccgc gagcgcagga gtaaacaccg 180 
ccggagtctt ggagccgctg cagaagggaa taaagagaga tgcagggatt tgtgaggtta 240 
cggcgcccca gctgcaagat gcactagccg gctgaacccg ggatcggctg acttgttgga 300 
accggagtgc tctgcacgga gagtggtgga tgagttgaag ttgccttccc ggggctcatt 360 
ttccacgctg ccgagaggaa tccgagaggc aaggcaatca cttcgtcttg ccattgattg 420 
ggtatcggga gctttttttt tctcccctct ctctttcttt tcctccgtct tgttgcatgc 480 
aagaaaatta cagtccgctg ctcgcccgcc ctgggtgcga gatattcagc cccgctctct 54 0 
cccgtgcatt gtgcaaccca aagatgaaag accgaagggg agaaagttaa agaaatcgcc 600 
cacatgcgct ggatcagtcc acggcttggg gaaaggcatc cagagaaggt gggagcggag 660 
agtttgaagt ctttacaggc gggaagatgg cggactggag ctgaaagtgt tgattgggaa 720 
acttgggtga ttcttgtgtt tatttacaat cctcttgacc caggcaggac acatgcaggc 780 
caaaaaacgc tatttcatcc tgctctcagc tggctcttgt ctcgcccttt tgttttattt 840 
cggaggcttg cagtttaggg catcgaggag ccacagccgg agagaagaac acagcggtag 900 
gaatggcttg caccacccca gtccggatca tttctggccc cgcttcccgg agcctctgcg 960 
ccccttcgtt ccttgggatc aattggaaaa cgaggattcc agcgtgcaca tttccccccg 1020 
gcagaagcga gatgccaact ccagcatcta caaaggcaag aagtgccgca tggagtcctg 1080 
cttcgatttc accctttgca agaaaaacgg cttcaaagtc tacgtatacc cacagcaaaa 1140 
aggggagaaa atcgccgaaa gttaccaaaa cattctagcg gccatcgagg gctccaggtt 1200 
ctacacctcg gaccccagcc aggcgtgcct ctttgtcctg agtctggata ctttagacag 12 60 
agaccagttg tcacctcagt atgtgcacaa tttgagatcc aaagtgcaga gtctccactt 1320 
gtggaacaat ggtaggaatc atttaatttt taatttatat tccggcactt ggcctgacta 1380 
caccgaggac gtggggtttg acatcggcca ggcgatgctg gccaaagcca gcatcagtac 1440 
tgaaaacttc cgacccaact ttgatgtttc tattcccctc ttttctaagg atcatcccag 1500 
gacaggaggg gagagggggt ttttgaagtt caacaccatc cctcctctca ggaagtacat 1560 
gctggtattc aaggggaaga ggtacctgac agggatagga tcagacacca ggaatgcctt 1620 
atatcacgtc cataacgggg aggacgttgt gctcctcacc acctgcaagc atggcaaaga 1680 
ctggcaaaag cacaaggatt ctcgctgtga cagagacaac accgagtatg agaagtatga 1740 
ttatcgggaa atgctgcaca atgccacttt ctgtctggtt cctcgtggtc gcaggcttgg 1800 
gtccttcaga ttcctggagg ctttgcaggc tgcctgcgtc cctgtgatgc tcagcaatgg 1860 
atgggagttg ccattctctg aagtgattaa ttggaaccaa gctgccgtca taggcgatga 1920 
gagattgtta ttacagattc cttctacaat caggtctatt catcaggata aaatcctagc 1980 
acttagacag cagacacaat tcttgtggga ggcttatttt tcttcagttg agaagattgt 2040 
attaactaca ctagagatta ttcaggacag aatattcaag cacatatcac gtaacagttt 2100 
aatatggaac aaacatcctg gaggattgtt cgtactacca cagtattcat cttatctggg 2160 
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agattttcct tactactatg ctaatttagg 
catccatgcg gtgacccccc tggtctctca 
tgcagccaag tcccagtact gtgcccagat 
accagccaaa caccgctggc ctgccactgc 
caaggttatg agcagccgtt ttctgcccta 
ccttgacgag gacacggtgc tttcaacaac 
gagcttccct gagaggattg tggggtaccc 
ggagcggtgg ggatacacat caaagtggac 
tgctatttac cacaaatatt atcactacct 
gaacatggtg gaccaattgg ccaattgtga 
tgtgacaaaa ttgcctccaa tcaaagtgac 
gggacagact tctcgggctt cccgttgggc 
ctgcatgaat acgtttgcca gctggtttgg 
gctcgacccc gtcctcttta aagaccaggt 
tgagcgactt tgaggaatcc ggctgagtgg 
agctgctctc tcttcccagt gcagatccac 
ccaaaaactt agatgagcag aatgacaaaa 
ggctcctggg actgcaccag actgctccaa 
ctaggttggt acagtttaat tatggaacat 
aaaa 



tttaaagccc ccctccaaat tcactgcagt 2220 
gtcccagcca gtgttgaagc ttctcgtggc 2280 
catagttcta tggaattgtg acaagcccct 2340 
tgtgcctgtc gtcgtcattg aaggagagag 2400 
cgacaacatc atcacagacg ccgtgctcag 24 60 
agaggtggat ttcgccttca cagtgtggca 2520 
cgcgcgcagc cacttctggg ataactctaa 2580 
gaacgactac tccatggtgt tgacaggagc 264 0 
atactcccat tacctgccag ccagcctgaa 2700 
ggacattctc atgaacttcc tggtgtctgc 27 60 
ccagaagaag cagtataagg agacaatgat 2820 
tgaccctgac cactttgccc agcgacagag 2880 
ctacatgccg ctgatccact ctcagatgag 2940 
ctctattttg aggaagaaat accgagacat 3000 
gggaggggaa gcaagaaggg atgggggtca 3060 
tcatcagcag agccagattg tgccaactat 3120 
aaaaaaaagg ccaatgagaa ctcaactcct 3180 
actcacctca ctggcttctg tgtcccaaga 3240 
taaataatta tttttgaaaa aaaaaaaaaa 3300 
3304 



<210> 378 
<211> 7680 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_002019 



<400> 378 

gcggacactc ctctcggctc ctccccggca 
tcgggtgcag cggccagcgg gcctggcggc 
ggctggagcc gcgagacggg cgctcagggc 
gactctggcg gccgggtcgt tggccggggg 
gcgctcacca tggtcagcta ctgggacacc 
ctgcttctca caggatctag ttcaggttca 
ggcacccagc acatcatgca agcaggccag 
gcccataaat ggtctttgcc tgaaatggtg 
aaatctgcct gtggaagaaa tggcaaacaa 
caagcaaacc acactggctt ctacagctgc 
aaggaaacag aatctgcaat ctatatattt 
atgtacagtg aaatccccga aattatacac 
tgccgggtta cgtcacctaa catcactgtt 
atccctgatg gaaaacgcat aatctgggac 
acgtacaaag aaatagggct tctgacctgt 
acaaactatc tcacacatcg acaaaccaat 
cgcccagtca aattacttag aggccatact 
ttgaacacga gagttcaaat gacctggagt 
gtaaggcgac gaattgacca aagcaattcc 
attgacaaaa tgcagaacaa agacaaagga 
tcattcaaat ctgttaacac ctcagtgcat 
catcgaaaac agcaggtgct tgaaaccgta 
aaagtgaagg catttccctc gccggaagtt 
gagaaatctg ctcgctattt gactcgtggc 
gaggatgcag ggaattatac aatcttgctg 
ctcactgcca ctctaattgt caatgtgaaa 
tttccagacc cggctctcta cccactgggc 
ggtatccctc aacctacaat caagtggttc 
gcaaggtgtg acttttgttc caataatgaa 
atgggaaaca gaattgagag catcactcag 
atggctagca ccttggttgt ggctgactct 



gcggcggcgg ctcggagcgg gctccggggc 60 
gaggattacc oggggaagtg gttgtctcct 120 
gcggggccgg cggcggcgaa cgagaggacg 180 
agcgcgggca ccgggcgagc aggccgcgtc 24 0 
ggggtcctgc tgtgcgcgct gctcagctgt 300 
aaattaaaag atcctgaact gagtttaaaa 360 
acactgcatc tccaatgcag gggggaagca 420 
agtaaggaaa gcgaaaggct gagcataact 4 80 
ttctgcagta ctttaacctt gaacacagct 540 
aaatatctag ctgtacctac ttcaaagaag 600 
attagtgata caggtagacc tttcgtagag 660 
atgactgaag gaagggagct cgtcattcoc 720 
actttaaaaa agtttccact tgacactttg 780 
agtagaaagg gcttcatcat atcaaatgca 840 
gaagcaacag tcaatgggca tttgtataag 900 
acaatcatag atgtccaaat aagcacacca 960 
cttgtcctca attgtactgc taccactccc 1020 
taccctgatg aaaaaaataa gagagcttcc 1080 
catgccaaca tattctacag tgttcttact 1140 
ctttatactt gtcgtgtaag gagtggacca 1200 
atatatgata aagcattcat cactgtgaaa 1260 
gctggcaagc ggtcttaccg gctctctatg 1320 
gtatggttaa aagatgggtt acctgcgact 1380 
tactcgttaa ttatcaagga cgtaactgaa 1440 
agcataaaac agtcaaatgt gtttaaaaac 1500 
ccccagattt acgaaaaggc cgtgtcatcg 1560 
agcagacaaa tcctgacttg taccgcatat 1620 
tggcacccct gtaaccataa tcattccgaa 1680 
gagtccttta tcctggatgc tgacagcaac 1740 
cgcatggcaa taatagaagg aaagaataag 1800 
agaatttctg gaatctacat ttgcatagct 1860 
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tccaataaag ttgggactgt gggaagaaac 
gggtttcatg ttaacttgga aaaaatgccg 
acagttaaca agttcttata cagagacgtt 
agaacaatgc actacagtat tagcaagcaa 
actcttaatc ttaccatcat gaatgtttcc 
gccaggaatg tatacacagg ggaagaaatc 
caggaagcac catacctcct gcgaaacctc 
accactttag actgtcatgc taatggtgtc 
aaccacaaaa tacaacaaga gcctggaatt 
attgaaagag tcacagaaga ggatgaaggt 
ggctctgtgg aaagttcagc atacctcact 
gagctgatca ctctaacatg cacctgtgtg 
ctccttatcc gaaaaatgaa aaggtcttct 
ataatggacc cagatgaagt tcctttggat 
agcaagtggg agtttgcccg ggagagactt 
tttggaaaag tggttcaagc atcagcattt 
gtggctgtga aaatgctgaa agagggggcc 
gagctaaaaa tcttgaccca cattggccac 
tgcaccaagc aaggagggcc tctgatggtg 
tccaactacc tcaagagcaa acgtgactta 
atggagccta agaaagaaaa aatggagcca 
gatagcgtca ccagcagcga aagctttgcg 
agtgatgttg aggaagagga ggattctgac 
gatctgattt cttacagttt tcaagtggcc 
tgcattcatc gggacctggc agcgagaaac 
atttgtgatt ttggccttgc ccgggatatt 
gatactcgac ttcctctgaa atggatggct 
accaagagcg acgtgtggtc ttacggagta 
tctccatacc caggagtaca aatggatgag 
aggatgagag ctcctgagta ctctactcct 
cacagagacc caaaagaaag gccaagattt 
cttcaagcaa atgtacaaca ggatggtaaa 
ggaaatagtg ggtttacata ctcaactcct 
atttcagctc cgaagtttaa ttcaggaagc 
aagttcatga gcctggaaag aatcaaaacc 
atgtttgatg actaccaggg cgacagcagc 
ttcacctgga ctgacagcaa acccaaggcc 
aaaagtaagg agtcggggct gtctgatgtc 
gggcacgtca gcgaaggcaa gcgcaggttc 
atcgcgtgct gctccccgcc cccagactac 
atctagagtt tgacacgaag ccttatttct 
aactagcttt tgccagtatt atgcatatat 
cagctgcttt ttgtgatttt tttaatagtg 
tccagataga gaaatagtga caagtgaaga 
tgttagagaa atccttccta aacccaatga 
gcacgcagga ccagtttgat tgaggagctg 
ctgggccagc cctgcagccc aaaacccagg 
tggctggcct gagcaacatc tcgggagtcc 
aaggaaaaaa agcaaaaagc aagggagaaa 
tttgagacgc accatgtggg cacggagggg 
ttcccagctc tgacccttct acatttgagg 
ggggacattt tctggattct gggaggcaag 
gcaaatttta gacctttacc tatggaagtg 
tttgatttgt agcactgagg gtggcactca 
taagatgcac tgaaaactta gccagagtta 
tgaaaatgtc acattctatt ttgggtatta 
ttggtattat tctgttttgc acagttagtt 
agtcctgagg agagttttct ccatatcaaa 
aggtcaaggg aagaccccgt ctctatacca 
ccaaaacaca ggaagtcagt cacgtttcct 
actaatctga aaggatgtgg aagagcatta 



ataagctttt atatcacaga tgtgccaaat 1920 
acggaaggag aggacctgaa actgtcttgc 1980 
acttggattt tactgcggac agttaataac 2040 
aaaatggcca tcactaagga gcactccatc 2100 
ctgcaagatt caggcaccta tgcctgcaga 2160 
ctccagaaga aagaaattac aatcagagat 2220 
agtgatcaca cagtggccat cagcagttcc 2280 
cccgagcctc agatcacttg gtttaaaaac 2340 
attttaggac caggaagcag cacgctgttt 2400 
gtctatcact gcaaagccac caaccagaag 24 60 
gttcaaggaa cctcggacaa gtctaatctg 2520 
gctgcgactc tcttctggct cctattaacc 2580 
tctgaaataa agactgacta cctatcaatt 2640 
gagcagtgtg agcggctccc ttatgatgcc 2700 
aaactgggca aatcacttgg aagaggggct 27 60 
ggcattaaga aatcacctac gtgccggact 2820 
acggccagcg agtacaaagc tctgatgact 2880 
catctgaacg tggttaacct gctgggagcc 2940 
attgttgaat actgcaaata tggaaatctc 3000 
ttttttctca acaaggatgc agcactacac 3060 
ggcctggaac aaggcaagaa accaagacta 3120 
agctccggct ttcaggaaga taaaagtctg 3180 
ggtttctaca aggagcccat cactatggaa 3240 
agaggcatgg agttcctgtc ttccagaaag 3300 
attcttttat ctgagaacaa cgtggtgaag 3360 
tataagaacc ccgattatgt gagaaaagga 3420 
cccgaatcta tctttgacaa aatctacagc 3480 
ttgctgtggg aaatcttctc cttaggtggg 3540 
gacttttgca gtcgcctgag ggaaggcatg 3600 
gaaatctato agatcatgct ggactgctgg 3660 
gcagaacttg tggaaaaact aggtgatttg 3720 
gactacatcc caatcaatgc catactgaca 3780 
gccttctctg aggacttctt caaggaaagt 3840 
tctgatgatg tcagatatgt aaatgctttc 3900 
tttgaagaac ttttaccgaa tgccacctcc 3960 
actctgttgg cctctcccat gctgaagcgc 4 020 
tcgctcaaga ttgacttgag agtaaccagt 4 080 
agcaggccca gtttctgcca ttccagctgt 4140 
acctacgacc acgctgagct ggaaaggaaa 4 200 
aactcggtgg tcctgtactc caccccaccc 4 260 
agaagcacat gtgtatttat acccccagga 4320 
aagtttacac ctttatcttt ccatgggagc 4 380 
cttttttttt ttgactaaca agaatgtaac 4 44 0 
acactactgc taaatcctca tgttactcag 4 500 
cttccctgct ccaacccccg ccacctcagg 4560 
cactgatcac ccaatgcatc acgtacccca 4 620 
gcaacaagcc cgttagcccc aggggatcac 4 680 
tctagcaggc ctaagacatg tgaggaggaa 47 40 
agagaaaccg ggagaaggca tgagaaagaa 4800 
gacggggctc agcaatgcca tttcagtggc 4860 
gcccagccag gagcagatgg acagcgatga 4 920 
aaaaggacaa atatcttttt tggaactaaa 4 980 
gttctatgtc cattctcatt cgtggcatgt 5040 
actctgagcc catacttttg gctcctctag 5100 
ggttgtctcc aggccatgat ggccttacac 5160 
atatatagtc cagacactta actcaatttc. 5220 
gtgaaagaaa gctgagaaga atgaaaatgc 5280 
acgagggctg atggaggaaa aaggtcaata 5340 
accaaaccaa ttcaccaaca cagttgggac 54 00 
tttcatttaa tggggattcc actatctcac 54 60 
gctggcgcat attaagcact ttaagctcct 5520 
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tgagtaaaaa ggtggtatgt aatttatgca aggtatttct ccagttggga ctcaggatat 5580 
tagttaatga gccatcacta gaagaaaagc ccattttcaa ctgctttgaa acttgcctgg 564 0 
ggtctgagca tgatgggaat agggagacag ggtaggaaag ggcgcctact cttcagggtc 5700 
taaagatcaa gtgggccttg gatcgctaag ctggctctgt ttgatgctat ttatgcaagt 57 60 
tagggtctat gtatttagga tgcgcctact cttcagggtc taaagatcaa gtgggccttg 5820 
gatcgctaag ctggctctgt ttgatgctat ttatgcaagt tagggtctat gtatttagga 5880 
tgtctgcacc ttctgcagcc agtcagaagc tggagaggca acagtggatt gctgcttctt 5940 
ggggagaaga gtatgcttcc ttttatccat gtaatttaac tgtagaacct gagctctaag 6000 
taaccgaaga atgtatgcct ctgttcttat gtgccacatc cttgtttaaa ggctctctgt 6060 
atgaagagat gggaccgtca tcagcacatt ccctagtgag cctactggct cctggcagcg 6120 
gcttttgtgg aagactcact agccagaaga gaggagtggg acagtcctct ccaccaagat 6180 
ctaaatccaa acaaaagcag gctagagcca gaagagagga caaatctttg ttgttcctct 6240 
tctttacaca tacgcaaacc acctgtgaca gctggcaatt ttataaatca ggtaactgga 6300 
aggaggttaa actcagaaaa aagaagacct cagtcaattc tctacttttt tttttttttt 6360 
tccaaatcag ataatagccc agcaaatagt gataacaaat aaaaccttag ctgttcatgt 64 20 
cttgatttca ataattaatt cttaatcatt aagagaccat aataaatact ccttttcaag 6480 
agaaaagcaa aaccattaga attgttactc agctccttca aactcaggtt tgtagcatac 6540 
atgagtccat ccatcagtca aagaatggtt ccatctggag tcttaatgta gaaagaaaaa 6600 
tggagacttg taataatgag ctagttacaa agtgcttgtt cattaaaata gcactgaaaa 6660 
ttgaaacatg aattaactga taatattcca atcatttgcc atttatgaca aaaatggttg 6720 
gcactaacaa agaacgagca cttcctttca gagtttctga gataatgtac gtggaacagt 6780 
ctgggtggaa tggggctgaa accatgtgca agtctgtgtc ttgtcagtcc aagaagtgac 6840 
accgagatgt taattttagg gacccgtgcc ttgtttccta gcccacaaga atgcaaacat 6900 
caaacagata ctcgctagcc tcatttaaat tgattaaagg aggagtgcat ctttggccga 6960 
cagtggtgta actgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgggtgtg 7020 
ggtgtatgtg tgttttgtgc ataactattt aaggaaactg gaattttaaa gttactttta 7080 
tacaaaccaa gaatatatgc tacagatata agacagacat ggtttggtcc tatatttcta 7140 
gtcatgatga atgtattttg tataccatct tcatataata tacttaaaaa tatttcttaa 7200 
ttgggatttg taatcgtacc aacttaattg ataaacttgg caactgcttt tatgttctgt 7260 
ctccttccat aaatttttca aaatactaat tcaacaaaga aaaagctctt ttttttccta 7320 
aaataaactc aaatttatcc ttgtttagag cagagaaaaa ttaagaaaaa ctttgaaatg 7380 
gtctcaaaaa attgctaaat attttcaatg gaaaactaaa tgttagttta gctgattgta 74 40 
tggggttttc gaacctttca ctttttgttt gttttaccta tttcacaact gtgtaaattg 7500 
ccaataattc ctgtccatga aaatgcaaat tatccagtgt agatatattt gaccatcacc 7560 
ctatggatat tggctagttt tgcctttatt aagcaaattc atttcagcct gaatgtctgc 7620 
ctatatattc tctgctcttt gtattctcct ttgaacccgt taaaacatcc tgtggcactc 7680 



<210> 379 
<211> 2679 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_002073 
<400> 379 

gagaccagga ccatgggatg tcggcaaagc 
cggagaattg accgccacct gcgctcagag 
ctcctgctgg gcaccagcaa ctcaggcaag 
cacagcggcg gcttcaacct ggaggcctgc 
gccatcgact cgctgacccg catcatccgg 
aaccccgacc gcgcctacga cgctgtgcag 
aagggcgaga tcacacccga gctgctgggt 
gcacaggcct gcttcagccg ctccagcgag 
ctgaacgacc tggagcgcat cgccgcagct 
cgctcccggg acatgaccac gggcattgtg 
ttcaagatgg tggacgtggg ggggcagagg 
gagggcgtca cagccatcat cttctgtgtg 



tcagaggaaa aagaagcagc ccggcggtcc 60 
agccagcggc aacgccgcga aatcaagctg 120 
agcaccatcg tcaaacagat gaagatcatc 180 
aaggagtaca agcccctcat catctacaat 240 
gccctggccg ccctcaggat cgacttccac 300 
ctctttgcgc tgacgggccc cgctgagagc 360 
gtcatgcgac ggctctgggc cgacccaggg 420 
taccacctgg aggacaacgc ggcctactac 4 80 
gactatatcc ccactgtcga ggacatcctg 540 
gagaacaagt tcaccttcaa ggagctcacc 600 
tcagagcgca aaaagtggat ccactgcttc 660 
gagctcagcg gctacgacct gaaactctac 720 
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gaggataacc agacaagtcg gatggcagag agcttgcgcc tctttgactc catctgcaac 780 

aacaactggt tcatcaacac ctcactcatc ctcttcctga acaagaagga cctgctggca 84 0 

gagaagatcc gccgcatccc gctcaccatc tgctttcccg agtacaaggg ccagaacacg 900 

tacgaggagg ccgctgtcta catccagcgg cagtttgaag aoctgaaccg caacaaggag 960 

accaaggaga tctactccca cttcacctgc gccaccgaca ccagtaacat ccagtttgtc 1020 

ttcgacgcgg tgacagacgt catcatacag aacaatctca agtacattgg cctttgctga 1080 

ggagctgggc ccggggcgcc tgcctatggt gaaacccacg gggtgtcatg ccccaacgcg 114 0 

tgctagagag gcccaatcca ggggcagaaa acagggggcc taaagaatgt cccccacccc 1200 

ttggcctctg cctccttggc cccacatttc tgcaaacata aatatttacg gatagattgc 12 60 

taggtagata gacacacaca catgcacaca cacacatctg gagatggcaa aatcctctaa 1320 

aatgtcgagg tctcttgaag acttgagaag ctgtcacaag gtcactacaa gcccaacctg 1380 

ccccttcact ttgccttcct gagttggccc cactccactt gggggtctgc attggattgt 1440 

tagggatagg cagcagggct gaggcaaggt aggccaactg cacccctgtc acctggagga 1500 

gggccggctc gctgcccgag ctctggccta gggaccttgc cgctgaccaa gagggaggac 1560 

cagtgcaggg tctgtgcacc ttccctgctg gcctgcacac agctgctcag caccatttca 1620 

ttctggacct gggaccttag gagccgggtg acagcactaa ccagacctcc agccactcac 1680 

agctcttttt aaaaaacagc ttcaaaatat gcagcaaaaa ccaatacaac aaaacgagtg 1740 

gcacgattta tttcaaacta ggccagctgg gattccagct tttcttctac tagtctgatg 1800 

ttttataaat caaaacctgg ttttccttct ctggcatttt tttttgtttt ttgttttttg 1860 

gttttttttt tttttttggc caaatctcgt ggtgtttcgc agaaaaaaat ccagaaaatt 1920 

tcaaatgcag ttgagtattc ttttttaaat gcagattttc aaaacatatt ttttttcagg 1980 

tggtcttttt tgtgtctggc ttgctgagtg taaaagttgt tatctggacg atctgtctct 2040 

ctgctccaaa gaaattttgg agtgagtggc agtcctgcgc cagcctcgcg ggacacgtgt 2100 

tgtacataag cctctgcagt gtcctcttgt taatggtggg gttttctgct ttgtttttat 2160 

ttaagaaaat aaacacgaca tatttaaaga aggttctttc acctgggagc aaatgaacaa 2220 

tagctaagtg tcttggtatt taaagagtaa attatttgtg gctttgctga gtgaaggaag 2280 

gggagcaagg ggtggtgccc ctggtcccag catgccccgc gcctgagact ggctggaaat 2340 

gctctgactc ctgtgaaggc acagccagcg ttgtggcctg agggaggccc tgctgggacc 2400 

ctgatctggg ccttcctgtc ccagggccta tgggcaactg cgttgaaagg acgttcgcca 2460 

agggccgtgt gtaaatacga actgcgccat ggagaggaga ggcactgccg gagcccttgc 2520 

cagatctccc tccctctctc tgtgcagtag ctgtgtgtcc gaggtcagtg tgcggaatca 2580 

cagccaagga cgtgaagaga tgtacggggg aaagagaagc tggggattgg atgaaagtca 2640 

aaggttgtct actttaagaa aataaaatac cctgaatgg 2679 

<210> 380 
<211> 3337 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_000436 
<400> 380 

gtcgagcctc tagcccgccc gggtttcctt cgcagtcgcg caccgacgct 

ctccaacccg cagcctcctc ctgcctcacc gcccgaagat ggcggctctc 

cctccgggct tcggctctgc gcctctgccc gcggatctgg ggcaacctgg 

gtgtttgttc cttttccacc agtgctcatc gccataccaa gttttataca 

aagctgtaaa agacatccct gatggtgcca cggttttggt tggtggtttt 

gaattccaga gaatcttata gatgctttac tgaaaactgg agtaaaagga 

tcagcaacaa tgcaggggtt gacaattttg gtttggggct tttgcttcgg 

taaaacgcat ggtctcttca tatgtgggag aaaatgcaga atttgaacga 

ctggtgaatt agaagtggag ctgacaccac agggcacact tgcagagagg 

gcggggctgg agttcctgca ttttacaccc caacagggta tgggaccctg 

gaggatcgcc catcaaatac aacaaagatg gcagtgttgc cattgccagt 

aggtgaggga gttcaatggt cagcacttta ttttggagga agcaattaca 

ctttggtgaa agcctggaag gcggaccgag caggaaacgt gattttcagg 

ggaatttcaa cttgccaatg tgcaaagctg cagaaaccac agtggtagag 

ttgtggatat tggagcattt gctccagaag acatccatat tcctcagatt 

gccttataaa gggagaaaaa tatgagaaaa gaattgagcg tttatcaatc 

gagatgggga agccaaatct gctaaacctg gagatgacgt aagggaacga 

gggccgctct tgagtttgag gatggcatgt atgctaattt gggcatagga 



caaacgcgcg 60 
aaactcctct 120 
tacaagggat 180 
gatccagtag 240 
gggctatgtg 300 
ctaactgcag 360 
tcgaagcaga 420 
cagtacttat 480 
atccgtgcag 54 0 
gtacaagaag 600 
aagccaagag 660 
ggggattttg 720 
aaaagtgcaa 780 
gttgaagaaa 840 
tatgtacatc 900 
cggaaagagg 960 
atcatcaaga 1020 
atccctctcc 1080 
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tggccagcaa ttttatcagc ccaaatataa ctgttcatct tcaaagtgaa aatggagttc 1140 
tgggtttggg tccatatcca cgacaacatg aagctgatgc agatctcatc aatgcaggca 1200 
aggaaacagt tactattctt ccaggagcct cttttttctc cagcgatgaa tcatttgcaa 1260 
tgattagagg tggacacgtc gatctgacaa tgctaggagc gatgcaggtt tccaaatatg 1320 
gtgacctggc taactggatg atacctggga agatggtgaa aggaatggga ggtgctatgg 1380 
atttagtgtc cagtgcgaaa accaaagtgg tggtcaccat ggagcattct gcaaagggaa 1440 
atgcacataa aatcatggag aaatgtacat taccattgac tggaaagcaa tgtgtcaacc 1500 
gcattattac tgaaaaggct gtgtttgatg tggacaagaa gaaagggttg actctgattg 1560 
agctctggga aggcctgaca gtggatgacg tacaaaagag tactgggtgt gattttgcag 1620 
tttcaccaaa actcatgcca atgcagcaga tcgcaaattg aaatatggat atttgtacca 1680 
ggctgcgtgt ttttcatttt aaacacacaa gatttaattg aaaggacatc aataatcata 1740 
attgtgtatt taacaggtgg ttttttatta gttttcttgt gtttcagact ttatgcagcc 1800 
atataaactg ttctctaggc atgctgtgac attttaataa aaagcaaaag gagcatttat 1860 
aattatctca tttgttaagg ctgagaaggt tgtttttata ataggtaatt atattgaatg 1920 
cattttcact gaatatggta tgtatgctaa attatatgaa cctttcccca agaagggccc 1980 
tagaaattga tgtggctttc ctcttaaata ttaattatta gtcctgaaag aaagataaca 2040 
tatgtgattt ttgtggttag gagagttgct gtcatgattg ttttttcttc agcctcctct 2100 
gacttttctt ttggggcttc agattttatg attacatctt gtccccctag aacatccccc 2160 
ttcctcccat actgctttta aacagatgcc caagaaggca agcaggaatg cctcttgtgg 2220 
gggagggcag ggagaaataa ctagttcaaa ccaactatct atctatgctt tgcaaagact 2280 
aaggcgtatt ataggaagag ggctagaaac ctaactgatt cttctcagtt ttctcatttt 2340 
aaaacagccc agtattcctt tgtatcctca agggtccttg agaatacttc tgttattgaa 2400 
accctgtggg ctacttgtac tgtacctcct ctcaagccaa gaagggctgt gggataattt 2460 
accatgaatc cttagtagca atgacagcag agttaaaaaa taaaaggtgt tttactttca 2520 
ggctcttgtt ttggttcaga ggagatttta aatattgaat gacacttcta cagaacaacg 2580 
gtttttcttc tgccaaggct acttccttta acgaagtgcc tttaattcag ccttatccaa 2640 
ctagggaaaa taatgttgga caagtctagg atttgaagag tcagtgaact tttagtgtca 2700 
gggaataaac atggtgggta gattaggttt gaaaaaaact tccttagagg tatttattct 2760 
caatacctga caggggccca tgggaatgac ttcagaagca tcccggataa tagatgggta 2820 
aaaagtctag gcaccctgaa gaacaggtga gacagctggc ctctggacag aggtaggcat 2880 
aqtacagtac gatatatcat tcctctggtc ctaaatatac aaacttattc atgtttttag 2940 
gtgatgatgg tcattgaaac tcacttcttt tcaggtgtag ctacaattgt gtaatgtaca 3000 
atattagaga aaggacaggc tttttatgag taacacacac catatataaa acagcctttc 3060 
tggctgacca catggttaaa tgcatacctt cccagtactg gggggaaaat gacccttctt 3120 
agaatgtgca agttccatag agtaatatat tgatatgatt ttgaaaagaa ttgttgatag 3180 
ttacatcttc aaacttatca ttccagtatg catctttaag ataatgtgat tctaagtaga 3240 
tgactttata ttcttgatta aagagtgcta tacatgttaa gaaatgcatt aaggaataca 3300 
ataaatattc taaactgatg aaaaaaaaaa aaaaaaa 3337 

<210> 381 
<211> 1020 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig55377_RC 
<400> 381 

tagttttcaa aaaaatattg 
accaaagctg caggtctcta 
agcatccaca tatgtatcat 
gtgaactaat tataatgaaa 
aaaatttaag ttaaaatgca 
aagacatgaa aacaatgctg 
gccagtccta gaacagtact 
aacactgaag accaaataga 
gttctggaaa cagcgatctg 
gtacctataa ccagatgtcc 
cacaacctga ggaacagctg 
tagccaggat tttctcgttt 
ttgggaggtg agcagaggaa 



atctggatca gatgggctga 
aggtcttcag agaagctaca 
tttccacaac attttacaat 
tatcagtaat gtatcttaaa 
ctacataata gtgcagaaac 
aatcagtagt gtttggagtg 
caccacaagc tgcatgccaa 
accactctag agtctgtttc 
ctgtagttcc tatcctcctg 
tccatcatcc agcctccagg 
ctggttaccg atttgtattc 
gtcttgaata tatttacttc 
tgtttacttg ctggagaaca 



taaggcccaa acacttataa 60 
caaaatacta tgatttcatt 120 
tctcttgttt cacagggaag 180 
taagtaacat gttgctggaa 240 
aaagtgcagg tggcaatttt 300 
atacattttg cccatttttg 360 
atatgaacta atgaggctaa 420 
acaatgagat actcctcccc 480 
tggcacctat ccctgctaca 540 
acaagtggtg ctcagtgagt 600 
ctgtatatac tcgacttggg 660 
taaaaagact gttagtcttg 720 
tagtaacaaa catcacttct 780 
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cacccctcat aaacagaaag cccattgtct gtgccgtgaa aagaggacta ccttgtcagg 840 
aagattttac ttacctctac aagaaaaagg actcgtatta aaatcgacct gtggttacca 900 
cccttagaaa tgctgttttc agatattcct attcctagtc ttagaggcag ccatggaata 960 
tatagtctcc tctgaccttg gggtcaggaa gaaagggaaa tctattcata aaaggaattt 1020 



<210> 382 
<211> 668 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig35251_RC 
<400> 382 

ctatacagta tttgtcagct gtttttattc ttcagtttat catcttatat ttggctctca 60 
atttcaccca taaattattg atggcttatt atacgccaag taccctgcgc tccgcttttt 120 
gtggtatccc tcttcggagc gagaagtcac ttagaatttt accccatctt ccccacttgg 180 
gaatcggaag ggttcatagc taagttttgt tatttataag cacgccggct cttctctatc 24 0 
cctctctcca ccaaccccat ttcaaggttg gtccctccca gcggacgaca gtgatgaatt 300 
cctcttgggg acagccctcc cgcaactaac ggtggctcca tcgacgctct ggtctccttg 360 
ctgctcgctg ggaattcggg tgggacgttt tccaaagttt ggattccacg gcacattcct 420 
ccctcggaaa ggcacaaccg gctcggcgtg tcagctcagc tggactcacg ccggtgcggg 480 
gccgcagctc cggttctcgc tgggcggctg cggagggctg ggcgattctc tcagcgtcgc 540 
gaggcgaccc gggcagcagc aaacttctcc gcttcccctc ttcgggacgc ggcacctctt 600 
ctgcccttga gcactccgca ggaacttccc ccggcctcgg tctcgctccg aaataaccga 660 
gccagctc 658 

<210> 383 
<211> 430 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig25991 
<400> 383 

agaggagtcg gcgtttgaag aggtggaact cctagggctt ttttgagagt gctgatttag 60 
aagaatacaa atcatggctg aaaatagtgt attaacatcc actactggga ggactagctt 120 
ggcagactct tccatttttg attctaaagt tactgagatt tccaaggaaa acttacttat 180 
tggatctact tcatatgtag aagaagagat gcctcagatt gaaacaagag tgatattggt 240 
tcaagaagct ggaaaacaag aagaacttat aaaagcctta aaggacatta aagtgggctt 300 
tgtaaagatg gagtcagtgg aagaatttga aggtttggat tctccggaat ttgaaaatgt 360 
atttgtagtc acggactttc aggattctgt ctttaatgac ctctacaagg ctgattgtag 420 



<210> 384 
<211> 2212 
<212> DNA 

<213> Homo Sapiens 
<300> 

<308> NM_003875 
<400> 384 

tgccggctgc tcctcgacca ggcctccttc tcaacctcag cccgcggcgc cgacccttcc 60 
ggcaccctcc cgccccgtct cgtactgtcg ccgtcaccgc cgcggctccg gccctggccc 120 
cgatggctct gtgcaacgga gactccaagc tggagaatgc tggaggagac cttaaggatg 180 
gccaccacca ctatgaagga gctgttgtca ttctggatgc tggtgctcag tacgggaaag 240 
tcatagaccg aagagtgagg gaactgttcg tgcagtctga aattttcccc ttggaaacac 300 
cagcatttgc tataaaggaa caaggattcc gtgctattat catctctgga ggacctaatt 360 



agttattgga 



430 
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ctgtgtatgc tgaagatgct ccctggtttg atccagcaat attcactatt ggcaagcctg 420 

ttcttggaat ttgctatggt atgcagatga tgaataaggt atttggaggt actgtgcaca 480 

aaaaaagtgt cagagaagat ggagttttca acattagtgt ggataataca tgttcattat 540 

tcaggggcct tcagaaggaa gaagttgttt tgcttacaca tggagatagt gtagacaaag 600 

tagctgatgg attcaaggtt gtggcacgtt ctggaaacat agtagcaggc atagcaaatg 660 

aatctaaaaa gttatatgga gcacagttcc accctgaagt tggccttaca gaaaatggaa 720 

aagtaatact gaagaatttc ctttatgata tagctggatg cagtggaacc ttcaccgtgc 780 

agaacagaga acttgagtgt attcgagaga tcaaagagag agtaggcacg tcaaaagttt 840 

tggttttact cagtggtgga gtagactcaa cagtttgtac agctttgcta aatcgtgctt 900 

tgaaccaaga acaagtcatt gctgtgcaca ttgataatgg ctttatgaga aaacgagaaa 960 

gccagtctgt tgaagaggcc ctcaaaaagc ttggaattca ggtcaaagtg ataaatgctg 1020 

ctcattcttt ctacaatgga acaacaaccc taccaatatc agatgaagat agaaccccac 1080 

ggaaaagaat tagcaaaacg ttaaatatga ccacaagtcc tgaagagaaa agaaaaatca 1140 

ttggggatac ttttgttaag attgccaatg aagtaattgg agaaatgaac ttgaaaccag 1200 

aggaggtttt ccttgcccaa ggtactttac ggcctgatct aattgaaagt gcatcccttg 12 60 

ttgcaagtgg caaagctgaa ctcatcaaaa cccatcacaa tgacacagag ctcatcagaa 1320 

agttgagaga ggagggaaaa gtaatagaac ctctgaaaga ttttcataaa gatgaagtga 1380 

gaattttggg cagagaactt ggacttccag aagagttagt ttccaggcat ccatttccag 14 4 0 

gtcctggcct ggcaatcaga gtaatatgtg ctgaagaacc ttatatttgt aaggactttc 1500 

ctgaaaccaa caatattttg aaaatagtag ctgatttttc tgcaagtgtt aaaaagccac 1560 

ataccctatt acagagagtc aaagcctgca caacagaaga ggatcaggag aagctgatgc 1620 

aaattaccag tctgcattca ctgaatgcct tcttgctgcc aattaaaact gtaggtgtgc 1680 

agggtgactg tcgttcctac agttacgtgt gtggaatctc cagtaaagat gaacctgact 1740 

gggaatcact tatttttctg gctaggctta tacctcgcat gtgtcacaac gttaacagag 1800 

ttgtttatat atttggccca ccagttaaag aacctcctac agatgttact cccactttct 1860 

tgacaacagg ggtgctcagt actttacgcc aagctgattt tgaggcccat aacattctca 1920 

gggagtctgg gtatgctggg aaaatcagcc agatgccggt gattttgaca ccattacatt 1980 

ttgatcggga cccacttcaa aagcagcctt catgccagag atctgtggtt attcgaacct 2040 

ttattactag tgacttcatg actggtatac ctgcaacacc tggcaatgag atccctgtag 2100 

aggtggtatt aaagatggtc actgagatta agaagattcc tggtatttct cgaattatgt 2160 

atgacttaac atcaaagccc ccaggaacta ctgagtggga gtaataaact tc 2212 

<210> 385 

<211> 2150 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_006101 



ctcgagccac gaaggccccg ctgtcctgtc tagcagatac ttgcacggtt tacagaaatt 60 

cggtccctgg gtcgtgtcag gaaactggaa aaaaggtcat aagcatgaag cgcagttcag 120 

tttccagcgg tggtgctggc cgcctctcca tgcaggagtt aagatcccag gatgtaaata 180 

aacaaggcct ctatacccct caaaccaaag agaaaccaac ctttggaaag ttgagtataa 24 0 

acaaaccgac atctgaaaga aaagtctcgc tatttggcaa aagaactagt ggacatggat 300 

cccggaatag tcaacttggt atattttcca gttctgagaa aatcaaggac ccgagaccac 360 

ttaatgacaa agcattcatt cagcagtgta ttcgacaact ctgtgagttt cttacagaaa 4 20 

atggttatgc acataatgtg tccatgaaat ctctacaagc tccctctgtt aaagacttcc 4 80 

tgaagatctt cacatttctt tatggcttcc tgtgcccctc atacgaactt cctgacacaa 540 

agtttgaaga agaggttcca agaatcttta aagaccttgg gtatcctttt gcactatcca 600 

aaagctccat gtacacagtg ggggctcctc atacatggcc tcacattgtg gcagccttag 660 

tttggctaat agactgcatc aagatacata ctgccatgaa agaaagctca cctttatttg 720 

atgatgggca gccttgggga gaagaaactg aagatggaat tatgcataat aagttgtttt 780 

tggactacac cataaaatgc tatgagagtt ttatgagtgg tgccgacagc tttgatgaga 840 

tgaatgcaga gctgcagtca aaactgaagg atttatttaa tgtggatgct tttaagctgg 900 

aatcattaga agcaaaaaac agagcattga atgaacagat tgcaagattg gaacaagaaa 960 

gagaaaaaga accgaatcgt ctagagtcgt tgagaaaact gaaggcttcc ttacaaggag 1020 

atgttcaaaa gtatcaggca tacatgagca atttggagtc tcattcagcc attcttgacc 1080 

agaaattaaa tggtctcaat gaggaaattg ctagagtaga actagaatgt gaaacaataa 1140 



<400> 385 
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aacaggagaa cactcgacta cagaatatca ttgacaacca gaagtactca gttgcagaca 1200 
tScgaat aaatcatgaa agaaatgaat tgcagcagac tattaataaa ttaaccaagg 1260 
acctggaagc tgaacaacag aagttgtgga atgaggagtt aaaatatgcc agaggcaaag 1320 
aagcgattga aacacaatta gcagagtatc acaaattggc tagaaaatta aaacttattc 1380 
ctaaaggtgc tgagaattcc aaaggttatg actttgaaat taagtttaat cccgaggctg 1440 
gtgccaactg ccttgtcaaa tacagggctc aagtttatgt acctcttaag gaactcctga 1500 
atgaaactga agaagaaatt aataaagccc taaataaaaa aatgggtttg ^aggatactt 1560 
tagaacaatt gaatgcaatg ataacagaaa gcaagagaag tgtgagaact ctgaaagaag 1620 
aagttcaaaa gctggatgat ctttaccaac aaaaaattaa ggaagcagag ^agaggatg 1680 
aaaaatgtgc cagtgagctt gagtccttgg agaaacacaa gcacctgcta gaaagtactg 1740 
ttaaccaggg gctcagtgaa gctatgaatg aattagatgc tgttcagcgg gaataccaac 1800 
tagttgtgca aaccacgact gaagaaagac gaaaagtggg aaataacttg ^acgtctgt 1860 
tagagatggt tgctacacat gttgggtctg tagagaaaca tcttgaggag ^gattgcta 1920 
aagttgatag agaatatgaa gaatgcatgt cagaagatct ctcggaaaat attaaagaga 1980 
ttagagataa gtatgagaag aaagctactc taattaagtc ttctgaagaa tgaagataaa 2040 
atgttgatca tgtatatata tccatagtga ataaaattgt ctcagtaaaa aaaaaaaaaa 2100 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 

<210> 386 
<211> 2830 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_003882 

cicacgcgtc cgctgggccc agctcccccg agaggtggtc ggatcctctg ggctgctcgg 60 
tcgatgcctg tgccactgac gtccaggcat gaggtggttc ctgccctgga cgctggcagc 120 
agtgacagca gcagccgcca gcaccgtcct ggccacggcc ctctctccag ^ c tacgac 180 
catggacttt actccagctc cactggagga cacctcctca cgcccccaat tctgcaagtg 240 
gccatgtgag tgcccgccat ccccaccccg ctgcccgctg ggggtcagcc tcatcacaga 300 
tggctgtgag tgctgtaaga tgtgcgctca gcagcttggg gacaactgca cggaggctgc 360 
catctgtgac ccccaccggg gcctctactg tgactacagc ggggaccgcc cgaggtacgc 420 
aataggagtg tgtgcacagg tggtcggtgt gggctgcgtc ctggatgggg tgcgctacaa 480 
caacggccag tccttccagc ctaactgcaa gtacaactgc acgtgcatcg acggcgcggt 540 
gggctgcaca ccactgtgcc tccgagtgcg ccccccgcgt ctctggtgcc cccacccgcg 600 
gcgcgtgagc atacctggcc actgctgtga gcagtgggta tgtgaggacg acgccaagag 660 
gccacgcaag accgcacccc gtgacacagg agccttcgat gctgtgggtg aggtggaggc 720 
atggcacagg aactgcatag cctacacaag cccctggagc ccttgctcca ccagctgcgg 780 
cctgggggtc tccactcgga tctccaatgt taacgcccag tgctggcctg agcaagagag 840 
ccgcSSgc aacttgcggc catgcgatgt ggacatccat acactcatta aggcagggaa 900 
gaagtgtctg gctgtgtacc agccagaggc atccatgaac ttcacacttg cgggctgcat 960 
cagcacacgc tcctatcaac ccaagtactg tggagtttgc atggacaata ^tgctgcat 1020 
cccctacaag tctaagacta tcgacgtgtc cttccagtgt cctgatgggc ttggcttctc 1080 
ccgccaggtc ctatggatta atgcctgctt ctgtaacctg agctgtagga atcccaatga 1140 
catctttgct gacttggaat cctaccctga cttctcagaa attgccaact aggcaggcac 1200 
aaatcttggg tcttggggac taacccaatg cctgtgaagc agtcagccct tatggccaat 1260 
aacttttcac caatgagcct tagttaccct gatctggacc cttggcctcc atttctgtct 1320 
ctaaccattc aaatgacgcc tgatggtgct gctcaggccc atgctatgag ttttctcctt 1380 
qatatcattc agcatctact ctaaagaaaa atgcctgtct ctagctgttc tggactacac 1440 
?caagcctga tccagccttt ccaagtcact agaagtcctg ctggatcttg cctaaatccc 1500 
aagaaatgga atcaggtaga cttttaatat cactaatttc ttctttagat gccaaaccac 1560 
aagactc£tt gggtccattc agatgaatag atggaatttg gaacaataga ataatctatt 1620 
atttggagcc tgccaagagg tactgtaatg ggtaattctg acgtcagcgc accaaaacta 1680 
fcctgatLc aaatatgtat gcacctcaag gtcatcaaac atttgccaag tgagttgaat 740 
agttqcttaa ttttgatttt taatggaaag ttgtatccat taacctgggc attgttgagg 1800 
ttaagtttct cttcacccct acactgtgaa gggtacagat taggtttgtc ccagtcagaa 1860 
ataaaatttg ataaacattc ctgttgatgg gaaaagcccc cagttaatac tccagagaca 1920 
gggaaaggtl agcccatttc agaaggacca attgactctc acactgaatc agctgctgac 1980 
?ggcagggct ttgggcagtt ggccaggctc ttccttgaat cttctccctt gtcctgcttg 2040 
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ggttcatagg aattggtaag gcctctggac tggcctgtct ggcccctgag agtggtgccc 2100 
tggaacactc ctctactctt acagagcctt gagagaccca gctgcagacc atgccagacc 2160 
cactgaaatg accaagacag gttcaggtag gggtgtgggt caaaccaaga agtgggtgcc 2220 
cttggtagca gcctggggtg acctctagag ctggaggctg tgggactcca ggggcccccg 2280 
tgttcaggac acatctattg cagagactca tttcacagcc tttcgttctg ctgaccaaat 234 0 
ggccagtttt ctggtaggaa gatggaggtt taccagttgt ttagaaacag aaatagactt 2400 
aataaaggtt taaagctgaa gaggttgaag ctaaaaggaa aaggttgttg ttaatgaata 24 60 
tcaggctatt atttattgta ttaggaaaat ataatattta ctgttagaat tcttttattt 2520 
agggcctttt ctgtgccaga cattgctctc agtgctttgc atgtattagc tcactgaatc 2580 
ttcacgacaa tgttgagaag ttcccattat tatttctgtt cttacaaatg tgaaacggaa 2640 
gctcatagag gtgagaaaac tcaaccagag tcacccagtt ggtgactggg aaagttagga 2700 
ttcagatcga aattggactg tctttataac ccatattttc cccctgtttt tagagcttcc 27 60 
aaatgtgtca gaataggaaa acattgcaat aaatggcttg attttttaaa aaaaaaaaaa 2820 
aaaaaaaaaa 2830 

<210> 387 

<211> 2785 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003607 
<400> 387 

tccgcaagcc tccgcctctg tgcgcgggac tggaggagcc tcgctgagcc cagggcgcga 60 
cggcgagagg agagggaagg cgggggaggg ctggaaggga gaggaaggga gtggttggga 120 
gccgggctgc cgcagcctct agtctcctca gccgcggaag cacccctcct cctgcgccgc 180 
ggccgcctcc ctcctcgctg tggaaagatg cccttagccc aggggtgtga agaaggggga 240 
gaagtagctg ccagagccgc ccgcgccgcc gccgcagctg ctgctcgttg tgtcctttga 300 
attcagagaa gcaccccccc gcctggggcg tcgggagcct ccgggcggcg gccgcggtgc 360 
ggtgtcgggg agaccgggct ctctgcccgc gcggcgcggc gcggctcggo ccacgagcga 420 
ccaccgacat ggagtgggct cgggcggcca agtagccgct tctccggagc cggtgccagt 4 80 
gccgcccgca gcccgccttc cacccccggc cgcgccgccg gtcaggccct agggtgaagc 540 
cgggaggaaa atgaagagtt ttcacccgga atccgttgaa aataggactg actgcaaagc 600 
cttaaagaaa gaaggacctc gggaggagaa acgaaaagcc gcctccgggc aagacttggc 660 
gtgctccgag ccgaggggct gcttcaggga cctcgccccc tccctttccc gctggagaaa 720 
ttgccgctga tgcattatcc aagtggtggt tgggaggatt tgcagcaaca tttttggttt 780 
tccctccccc ttctatgcat tctgtttttt tcctcccttt tctgtttttc ttcttcccgg 84 0 
gaagtgaatt gctgatgcaa atcggacttt attcattaat gatgcaaccg gattcgtttc 900 
aggattacgt tgcacgagtt gaattttgaa tgaaggagaa gagttttttt ttttttttaa 960 
agaagtgttg actctctagt tcgttgtact tttaattatt attttattta aatatacgac 1020 
ttaattgtat tcttttaaaa atgcattaag tatatatttt atggtaattt accctcaaaa 1080 
tatatgtata tgggtgaaat tgaagacgcc ttcagttaag tgaggttact ggtgtgttgg 1140 
atgtttaatt cagcaccagc attgcatgac agttgtttga ataacaagtg gtttattttt 1200 
aaaaccatac cttttaaaat ttaggttcag ataatagtaa aagtcatcat aataatttaa 1260 
aggaaaacca gcagaaatcg aagcaaacat gtctggagaa gtgcgtttga ggcagttgga 1320 
gcagtttatt ttggacgggc ccgctcagac caatgggcag tacttcagtg tggagacatt 1380 
actggatata ctcatctgcc tttatgatga atgcaataat tctccattga gaagagagaa 14 4 0 
gaacattctc gaatacctag aatgggctaa accatttact tctaaagtga aacaaatgcg 1500 
attacataga gaagactttg aaatattaaa ggtgattggt cgaggagctt ttggggaggt 15 60 
tgctgtagta aaactaaaaa atgcagataa agtgtttgcc atgaaaatat tgaataaatg 1620 
ggaaatgctg aaaagagctg agacagcatg ttttcgtgaa gaaagggatg tattagtgaa 1680 
tggagacaat aaatggatta caaccttgca ctatgctttc caggatgaca ataacttata 1740 
cctggttatg gattattatg ttggtgggga tttgcttact ctactcagca aatttgaaga 1800 
tagattgcct gaagatatgg ctagatttta cttggctgag atggtgatag caattgactc 1860 
agttcatcag ctacattatg tacacagaga cattaaacct gacaatatac tgatggatat 1920 
gaatggacat attcggttag cagattttgg ttcttgtctg aagctgatgg aagatggaac 1980 
ggttcagtcc tcagtggctg taggaactcc agattatatc tctcctgaaa tccttcaagc 2040 
catggaagat ggaaaaggga gatatggacc tgaatgtgac tggtggtctt tgggggtctg 2100 
tatgtatgaa atgctttacg gagaaacacc attttatgca gaatcgctgg tggagacata 2160 
cggaaaaatc atgaaccaca aagagaggtt tcagtttcca gcccaagtga ctgatgtgtc 2220 
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tgaaaatgct aaggatctta ttcgaaggct catttgtagc agagaacatc gacttggtca 2280 

aaatggaata gaagacttta agaaacaccc atttttcagt ggaattgatt gggataatat 234 0 

tcggaactgt gaagcacctt atattccaga agttagtagc ccaacagata catcgaattt 2400 

tgatgtagat gatgattgtt taaaaaattc tgaaacgatg cccccaccaa cacatactgc 2460 

attttctggc caccatctgc catttgttgg ttttacatat actagtagct gtgtactttc 2520 

tgatcggagc tgtttaagag ttacggctgg tcccacctca ctggatcttg atgttaatgt 2580 

tcagaggact ctagacaaca acttagcaac tgaagcttat gaaagaagaa ttaagcgcct 264 0 

tgagcaagaa aaacttgaac tcagtagaaa acttcaagag tcaacacaga ctgtccaagc 2700 

tctgcagtat tcaactgttg atggtccact aacagcaagc aaagatttag aaataaaaaa 2760 

cttaaaagaa gaaatttgaa aaaaa 2785 

<210> 388 
<211> 1912 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AF073519 
<400> 388 

ccggcggctg ttgtcgggcc tccagcgggc ggggccgttg gcggagcaga gcggaggcgc 60 
agccgggcgg agggcccacg agggctcagc cttcccggtc agcggtggtg acggtatccc 120 
agagtgccag agaaccgttg cttttccgag ttgctcttct tccaggctcc gttggtggtc 180 
ggcatggccc gtggaaatca acgagaactt gcccgccaga aaaacatgaa gaaaacccag 240 
gaaattagca agggaaagag gaaagaggat agcttgactg cctctcagag aaagcagagt 300 
tctggaggcc agaaatctga gagcaagatg tcagctgggc cacacctccc tctgaaggct 360 
ccaagggaga atccttgctt tcctcttcca gctgctggtg gctccaggta ttacttggct 420 
tatggcagca taactcctat ctctgccttt gtctttgtgg tcttcttttc tgtcttcttc 480 
ccttcttttt atgaggactt ttgctgttgg atttaggttc cattctaacc taggatgatc 540 
tcatttggaa atccttaatt tcatctacaa aaactgtttt cccaaatagg tcacattcac 600 
gcatatcaga tggacagatg tatcattttg gggtccacca ttcaacccac tacaaggagt 660 
tttttaaaca aaaataggaa acttagatgt aacttagcac tttttttttt tttttttgag 720 
atggagtctc actctgtcac cagactggag tgcagtggcg ccatctcagc tccatgcaac 780 
ctctgcctcc tgggttcaag cagttctctt gcctcagcct cctgggtagc tgggattaca 840 
ggcacgcgct gccacaccca ggtaatttat ttattttttt tttgagacag agtctcgcac 900 
tgttgcccag gctggactgc agtggcgtga tctctgctca ctgcaacctc cgcctcccgg 960 
gttcaagcga ttctccagcc tcagcttcct gagtagatgg gattacaggc gcctgccacc 1020 
acgcccagct aatttttttg tattcttagt agagatgggg tttcaccatg ttggccaggc 1080 
tggtctccat ctcctgacct cgtgattcac ccgcctcggc ctcccaaagt gctgggatta 1140 
caggcgtgag tcacagcccc cggccataat ttagcacttt aaaaaataat agccatgttg 1200 
ggccaggcgt ggtggctcat gcctgtaatc tgagcacttt gggagaccaa ggcgggtaga 1260 
tcccttgtgc ccaggagttc aagaccagcc tgggcaacat ggcgaaaccc catttctact 1320 
aaaaatacaa aaattagctg gggcgagggg ataggccgag ttccgggtgt aagggggcca 1380 
ttagggagag cagagcgagg cagctgatct tccggattgg gggccttgcc cggaagctgg 1440 
acctcacgga gatgaaacgg aagatgcacg aggatatgat ctccatacag aactttctca 1500 
tctacgtggc cctgctgcga gtcactccat ttatcttaaa gaaattggac agcatatgaa 1560 
gattggacat cacatgtgaa tgcatgatat gaagagcctg gttacagttt ctactgttct 1620 
ctgcaagtaa ataggcccag aaaggtataa gagactcttt gaatggacat aaaaattctg 1680 
cttgttaaga acaagttgag ctctggtaac tgatcttaat agctaaaata taaaaatatt 1740 
tgggaagtct gaaatgaggt ctcctggccc tggtgtgccc ttaatgcctg tgacagttgg 1800 
cctctgtgaa tattggtata attgtaaata atgtcaaact ccattttcta gcaagtatta 18 60 
ataattaagg gaagtatgtc tgaaatggca aaaaaaaaaa aaaaaaaaaa aa 1912 

<210> 389 

<211> 1266 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000849 
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<400> 389 

ctcggaagcc cgtcaccatg tcgtgcgagt cgtctatggt tctcgggtac tgggatattc 60 
gtgggctggc gcacgccatc cgcctgctcc tggagttcac ggatacctct tatgaggaga 120 
aacggtacac gtgcggggaa gctcctgact atgatcgaag ccaatggctg gatgtgaaat 180 
tcaagctaga cctggacttt cctaatctgc cctacctcct ggatgggaag aacaagatca 240 
cccagagcaa tgccatcttg cgctacatcg ctcgcaagca caacatgtgt ggtgagactg 300 
aagaagaaaa gattcgagtg gacatcatag agaaccaagt aatggatttc cgcacacaac 360 
tgataaggct ctgttacagc tctgaccacg aaaaactgaa g.cctcagtac ttggaagagc 420 
tacctggaca actgaaacaa ttctccatgt ttctgtggaa attctcatgg tttgccgggg 480 
aaaagctcac ctttgtggat tttctcacct atgatatctt ggatcagaac cgtatatttg 540 
accccaagtg cctggatgag ttcccaaacc tgaaggcttt catgtgccgt tttgaggctt 600 
tggagaaaat cgctgcctac ttacagtctg atcagttctg caagatgccc atcaacaaca 660 
agatggccca- gtggggcaac aagcctgtat gctgagcagg aggcagactt gcagagcttg 720 
ttttgtttca tcctgtccgt aaggggtcag cgctcttgct ttgctctttt caatgaatag 780 
cacttatgtt actggtgtcc agctgagttt ctcttgggta taaaggctaa aagggaaaaa 840 
ggatatgtgg agaatcatca agatatgaat tgaatcgctg cgatactgtg gcatttccct 900 
actccccaac tgagttcaag ggctgtaggt tcatgcccaa gccctgagag tgggtactag 960 
aaaaaacgag attgcacagt tggagagagc aggtgtgtta aatggactgg agtccctgtg 1020 
aagactgggt gaggataaca caagtaaaac tgtggtactg atggacttaa ccggagttcg 1080 
gaaaccgtcc tgtgtacaca tgggagttta gtgtgataaa ggcagtattt cagactggtg 1140 
ggctagccaa tagagttggc aattgcttat tgaaactcat taaaaataat agagccccac 1200 
ttgacactat tcactaaaat taatctggaa tttaaggccc aacattaaac acaaagctgt 1260 
attgat 1266 

<210> 390 
<211> 517 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig32185_RC 
<400> 390 

ttaattagag tttgtagtat acacttaact ttctgtaatc tgcagtgatg aatctctatg 60 

taaacattca gaaaaagagc gaatactgtg tcatgactta tgaactataa attttgtccc 120 

tgattactag tcaagaacag ttttatacat ttattaacaa ctactttcct ccacttaaca 180 

acagctctta aataacaaaa taaattatca gcatttccaa ccacaaattt aagaattaaa 240 

gtttaaaaaa ctggttagtt tcataaatgc aagcagagta ataaatttca tatatgttaa 300 

ttgcagttac ttatgttctt tgagtgaagt aacacaaaga tttttccctt ccccttttgg 360 

caagttcaca aaattcatga aaagtcttca cctttttctt ttttgcttaa tttctttttt 420 

gcatctcagc tgaacacaac agacaatagc tggggttatt agcgagacat gataaaagaa 480 

aaggttatga ccactcagac gatcatgtat acctaaa 517 

<210> 391 
<211> 1266 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_016577 
<400> 391 

gcgcgccgtc cctgcgcccc gctccgcccg cgcctctctc ccagcgccgc gcctcgggcg 60 
taaagagcgc gcccctcgca ccgcagccag tgccgaccgc agcagcccag ccccagcctt 120 
cctccgcctc cgcctctctc tcctcctcct cctccgccgc cggacacgca gccgcagccg 180 
ggaccgggac gcagctgggg agtcagggac gcgcgcagcc agcccttccc cctccggctc 240 
ccgcaccgcc ggccgcctcc cctcgccctc ctactctccc ctccctgctc cttcgctttt 300 
tcctcctcct cctctcccgg ccccggctgc cagcaccatg tccgcagggg gagattttgg 360 
gaatccactg agaaaattca agttggtgtt cttgggggag cagagcgtcg ggaagacgtc 420 
tctgattacg aggttcatgt acgacagctt cgacaacaca taccaggcaa ccattgggat 480 
tgacttcttg tcaaaaacca tgtacttgga ggaccgcacg gtgcgactgc agctctggga 540 
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cacagctggt caggagaggt tccgcagcct 
ggctgtggtg gtgtacgaca tcacaaatct 
cgacgacgtc aggacagaga ggggcagtga 
ggacctggct gataagaggc agataaccat 
gagcgtcatg ttcattgaga ccagtgcgaa 
acgtgtggcg tcggctctac ccggaatgga 
gatcgacatc aagctggaca aaccccagga 
ctaatgcaga gccgacctgt ggcttcccat 
ttggctagct tcctaagggg ggagggaacc 
ctctctgtct ttaggagtag ggtgggatgg 
actcttaacg gctgttactt aaacaactat 
tattaagatt gccaaaaact gttaaaattt 
ttttgc 



gatccccagc tacatccggg actccacggt 600 
caactccttc caacagacct ctaagtggat 660 
tgttatcatc atgctggtgg gcaacaagac 720 
cgaggagggg gagcagcgcg ccaaagaact 780 
gactggctac aacgtgaagc agctttttcg 840 
gaatgtccag gagaaaagca aagaagggat 900 
gcccccggcc agcgagggcg gctgctcctg 960 
gacactcctt gcttgttgtg ttgcttccta 1020 
gagttatcaa gatgggagga tttttctttt 1080 
ggagggaggc tgggcatcag ggatcacatc 1140 
tttttggttt ggttgtaata tattgtactt 1200 
aaaaaaaatt taaatcatgt gtatacaatt 1260 
1266 



<210> 392 
<211> 472 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig48328_RC 



<400> 392 

gtacttttat tgaaaaggta catttaaaaa 
gcagatatag atgctctaaa agagtccact 
cctgatatca caaacattcc agtcttgttg 
atgacctgca atatattcag cgaacagaaa 
cttctgtaca ttattgcatt agggagccac 
caggttaaaa atgaagaaga . tacttatata 
aatgcacaag aggtgcaaaa atgtgcaatt 
tgcttagcaa cacaaaccag tgaggaagct 



aatacacaga cattttacca tttacaggtt 60 
ctattttgtt gttctatgat aactcttgcc 120 
atatcggctt agaaaggggg gcatgggagc 180 
gacaaattgt tcaattataa atttttttat 240 
aaaattatgt agcatcatta caaatgaaaa 300 
gaaatacatg gattcattgt cttcttgcag 360 
taggaagctc tttttctgtt tgtatacgtt 420 
acaaaataag ttaaacaaaa at 472 



<210> 393 
<211> 431 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig46223_RC 



<400> 393 

cttgtccaag atcatttatt gttattatta 
tggctcctca cacacatcct ctctggtggg 
cttgtaccca cctgggcccc cggccagcag 
cctgtccggg ctcgcctgcc ccagtccctg 
ctctgcctgc accctgagca tccccaggac 
gaagtgcctg agatggttca tgaaatgccc 
acgcctccct cctgctgtcc cacacagggc 
aggaaaaatt c 

<210> 394 

<211> 1728 

<212> DNA 

<213> Homo sapiens 



tttttctctc ttaattggat tcagcgtttc 60 
ggattcaggt ttgtgcaaag cacaaaggag 120 
cacggccggc tttgatcgga gcccctcttc 180 
cttctgaccc ctgagtccgg cctcccgagc 240 
ggctgggagg gccaggtctt ccgtgatgca 300 
tcattgttcc ccttctgcta aggccccgcg 360 
ccttgctggg tcacacacca aacagcctca 420 
431 



<300> 

<308> NM_015984 



<400> 394 

cgggaagtag gcggggtgac gtgtggttga cgagctcggc ggcgggtttg ctgagatctg 60 
tggccggcgg cagctggtgc ggggggcagc tgagagcgag aggtggatcg gggcggtgtg 120 
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tggccagggc catgacgggc aacgccgggg agtggtgcct catggaaagc gaccccgggg 180 
tcttcaccga gctcattaaa ggattcggtt gccgaggagc ccaagtagaa gaaatatgga 24 0 
gtttagagcc tgagaatttt gaaaaattaa agccagttca tgggttaatt tttcttttca 300 
agtggcagcc aggagaagaa ccagcaggct ctgtggttca ggactcccga cttgacacga 360 
tattttttgc taagcaggta attaataatg cttgtgctac tcaagccata gtgagtgtgt 420 
tactgaactg tacccaccag gatgtccatt taggcgagac attatcagag tttaaagaat 4 80 
tttcacaaag ttttgatgca gctatgaaag gcttggcact gagcaattca gatgtgattc 540 
gacaagtaca caacagtttc gccagacagc aaatgtttga atttgatacg aagacatcag 600 
caaaagaaga agatgctttt cactttgtca gttatgttcc tgttaatggg agactgtatg 660 
aattagatgg attaagagaa ggaccgattg atttaggtgc atgcaatcaa gatgattgga 720 
tcagtgcagt aaggcctgtc atagaaaaaa ggatacaaaa gtacagtgaa ggtgaaattc 780 
gatttaattt aatggccatt gtgtctgaca gaaaaatgat atatgagcag aagatagcag 840 
agttacaaag acaacttgca gaggaggaac ccatggatac agatcaaggt aatagtatgt 900 
taagtgctat tcagtcagaa gttgccaaaa atcagatgct tattgaagaa gaagtacaga 960 
aattaaaaag atacaagatt gagaatatca gaaggaagca taattatctg cctttcatta 1020 
tggaattgtt aaagacttta gcagaacacc agcagttaat accactagta gaaaaggcaa 1080 
aagaaaaaca gaacgcaaag aaagctcagg aaaccaaatg aagatgtttt cagatatgta 1140 
cacatttctg cttctgcaca tattttcatg gaaaccatta tgtataaaga acttagagca 1200 
acatcctaat tggctcagtg cacgtttggc aatagtgcca gcctgtcttg tctttaatgc 1260 
atggattcat aaacttcttc cctacctgca tcatgtgcat gtagtgcata ttaaatgaaa 1320 
gtgatattaa gaatgctttc ccaaattcca ttatttgaca ttgagtctga caactgttag 1380 
ttttctggtt gtccaactac catatgaagc tagaaaatgc acaaacgata ttccttatct 1440 
gtaatttaaa tacttaaaat ttgcaattgt cagatcttga ttaaactggt tgtcttattt 1500 
cttctcatca ttaacggaaa aaaaatcagt atttctatct ttgatatcta agtgttttga 1560 
ggattttaaa actgaatttt atctgctata ccagttattt gagaaagtat gattttaatg 1620 
taaatcattt aaaaaggaca aaagtataat ttccagtgat tttcactgct gtcagtagaa 1680 
aagtaataaa catctcaatt ttaaaaaaaa aaaaaaaaaa aaaaaaaa 1728 

<210> 395 

<211> 1348 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006117 
<400> 395 

caggtggcgt acttggcttg gagactggcg cggcgttcgt gtccgagttc tctgcaggtc 60 
actagtttcc cggtagttca gctgcacatg aatagaacag caatgagagc cagtcagaag 120 
gactttgaaa attcaatgaa tcaagtgaaa ctcttgaaaa aggatccagg aaacgaagtg 180 
aagctaaaac tctacgcgct atataagcag gccactgaag gaccttgtaa catgcccaaa 240 
ccaggtgtat ttgacttgat caacaaggcc aaatgggacg catggaatgc ccttggcagc 300 
ctgcccaagg aagctgccag gcagaactat gtggatttgg tgtccagttt gagtccttca 360 
ttggaatcct ctagtcaggt ggagcctgga acagacagga aatcaactgg gtttgaaact 420 
ctggtggtga cctccgaaga tggcatcaca aagatcatgt tcaaccggcc caaaaagaaa 480 
aatgccataa acactgagat gtatcatgaa attatgcgtg cacttaaagc tgccagcaag 54 0 
gatgactcaa tcatcactgt tttaacagga aatggtgact attacagtag tgggaatgat 600 
ctgactaact tcactgatat tccccctggt ggagtagagg agaaagctaa aaataatgcc 660 
gttttactga gggaatttgt gggctgtttt atagattttc ctaagcctct gattgcagtg 720 
gtcaatggtc cagctgtggg catctccgtc accctccttg ggctattcga tgccgtgtat 780 
gcatctgaca gggcaacatt tcatacacca tttagtcacc taggccaaag tccggaagga 840 
tgctcctctt acacttttcc gaagataatg agcccagcca aggcaacaga gatgcttatt 900 
tttggaaaga agttaacagc gggagaggca tgtgctcaag gacttgttac tgaagttttc 960 
cctgatagca cttttcagaa agaagtctgg accaggctga aggcatttgc aaagcttccc 1020 
ccaaatgcct tgagaatttc aaaagaggta atcaggaaaa gagagagaga aaaactacac 1080 
gctgttaatg ctgaagaatg caatgtcctt cagggaagat ggctatcaga tgaatgcaca 1140 
aatgctgtgg tgaacttctt atccagaaaa tcaaaactgt gatgaccact acagcagagt 1200 
aaagcatgtc caaggaagga tgtgctgtta cctctgattt ccagtactgg aactaaataa 1260 
gcttcattgt gccttttgta gtgctagaat atcaattaca atgatgatat ttcactacag 1320 
ctctgatgaa taaaaagttt tgtaaaac 1348 
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<210> 396 
<211> 3106 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AK000745 
<400> 396 

cggccgcggc ccccgacgac ctggccttgc tgaagaatct ccggagcgag gaacagaaga 60 

agaagaaccg gaagaaactg tccgagaagc ccaaaccaaa tgggcggact gttgaagtgg 120 

ctgagggtga agctgttcga acacctcaaa gtgtaacagc aaagcagcca ccagagattg 180 

acaagaaaaa tgaaaagtca aagaaaaata agaagaaatc aaagtcagat gctaaagcag 240 

tgcaaaacag ttcacgccat gatggaaagg aagttgatga aggagcctgg gaaactaaaa 300 

ttagtcacag agagaaacga cagcagcgta aacgtgataa ggtgctgact gattctggtt 360 

cattggattc aactatccct gggatagaaa ataccatcac agttaccacc gagcaactta 420 

caaccgcatc atttcctgtt ggttccaaga agaataaagg tgattctcat ctaaatgttc 480 

aagttagcaa ctttaaatct ggaaaaggag attctacact tcaggtttct tcaggattga 540 

atgaaaacct cactgtcaat ggaggaggct ggaatgaaaa gtctgtaaaa ctctcctcac 600 

agatcagtgc aggtgaggag aagtggaact ccgtttcacc tgcttctgca ggaaagagga 660 

aaactgagcc atctgcctgg agtcaagaca ctggagatgc taatacaaat ggaaaagact 720 

ggggaaggag ttggagtgac cgttcaatat tttctggcat tgggtctact gctgagccag 780 
tttctcagtc taccacttct gattatcagt gggatgttag ccgtaatcaa ccctatatcg 840 
atgatgaatg gtctgggtta aatggtctgt cttctgctga tcccaactct gattggaatg 900 
caccagcaga agagtggggc aattgggtag acgaagaaag agcttcactt ctaaagtccc 960 
aggaaccaat tcctgatgat caaaaggtct cagatgatga taaagaaaag ggagagggag 1020 
ctcttccaac tgggaaatcc aaaaagaaaa aaaagaaaaa gaagaagcaa ggtgaagata 1080 
actctactgc acaggacaca gaagaattag aaaaagagat tagagaagac cttccagtga 1140 
atacctctaa aacccgtcca aaacaggaaa aagctttttc cttgaagacc ataagcacta 1200 
gtgatccagc cgaagtactc gtcaaaaata gccagcctat caagactctt ccacctgcta 1260 
cttctaccga gccatctgta atcttatcaa aaagtgattc tgacaagagc tcttcccaag 1320 
tgccgccaat actacaagag acagataaat ccaagtcaaa taccaagcaa aatagtgtgc 1380 
ctccttcaca gaccaagtct gaaactagct gggaatctcc caaacaaata aaaaagaaga 1440 
aaaaagccag acgagaaacg tgaaattttt tttcctgaat tggacatgtg tttgcaaaca 1500 
cttgtcttga agattatgct gtttatgcaa taatttgtga acatgtacag agttttatat 1560 
aaatttaaac caatttttaa aacaaaactg cggacaccac cataaaaatg gaatcaaaag 1620 
aaagttaatt tatgaaatta agaggtcagc agaatatact cagtgatgga agacacttgg 1680 
gaaagtcttt ttaatagaac aagaacgatc ttaatttaag aatattatcc tggtttaaca 1740 
acagtgccct gtttacaaca gattgtgccc tatctcatct gcagccgagg aataaaggat 1800 
tctgattaga aagagggttg cctacagatt agtaagcaat tccttggatc ttatgcacag 1860 
aacttgtacc atttgaatct gttttatgct taaatcaaag tgctttgatc aaatgcataa 1920 
cctgccatat ctttacatat ttgttggtag caatttgtat taaagaaatc acaagtgoaa 1980 
ataaaaagtc atttatcatt tgtttaacta aactgtcatg gtttagttta caatttttaa 2040 
aaagttctta aaatactgaa aatgcagttg acacttgtgt atggcttatg aagttatttt 2100 
tgatagtctt acattacttg aattgttcaa agtacagtat attttaaatt aagaaaagtg 2160 
aactatatgt atttgtttta tacatttaag gcttagactc ataaataatg ctattgttta 2220 
tgatttgaaa actttcaggc aaaatccaat ttacattttt cccttcccta gcaattactt 2280 
ttttccagct tcaactcttc ttagttacta atactttgtt gactttaaaa atgaaatcat 2340 
tcacaaactt ttggtatatg atggagaatg aaaaactaga gtcagacagc tttaattgac 2400 
attgtcaaca cctccagtta tcaggaatac atttttttac tgccttaacc tgtagtgcgt 24 60 
agaatatgca tcaatttctt gaaggagatt catgttttta taagaatttt catgtaatta 2520 
ttgcaattgt ggtcaaataa ggaacgtttc ctgcttgaaa ttatattgat ttaaatgatg 2580 
tgtgagatgt ttcaccattt tcaggcactg tgtaattcta ttgtaataaa ctggcaggta 2640 
tctttgtaac tataaatagt gcatgctcag ccatgtacac tgtaaatagc ctttaccaaa 2700 
cgtgtttgac aaggaccata attaacatca cttagtgaat tgtgataaag aaaaaaaagc 2760 
catgatttat tcgatgtgat tggcttgttt ttatgtggcg ccaagaacga acctgtttaa 2820 
cagctgtaac caatggtact gatctatcca tccaatgttg tcattatatt tgactgtggt 2880 
tcaacagtat tgcgttgtca gactaggaaa gctaaacgaa caaaatggtt ttagttttgc 2940 
tgaagactgg ccttattaat ggacagcttt cctaacaaga gattattaac ttttatcagg 3000 
tgttaacatc tgtttcagga acatggcagt atgtttacat gtcagaagtt ttgtttaatt 3060 
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ctatggtatt tctaaattga cttgtttaaa taaattcagc aaatgg 



3106 . 



<210> 397 

<211> 556 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig40831_RC 
<400> 397 

cttgttttat catgtaattt tcagaaaata tgtgatccat aaagattaaa agaaagttgt 60 

attaagtctg gcagctttag tattaacttg aaataaaata tggcaagctt tccacgtcct 120 

cctttatttc cacaatccat atgtacgagc tagattccag tcagaacttc cacaaatact 180 

tcactctttg gtagcagcgg ttataaatta cgcctttgct aatttgcgtt gttcccaacc 240 

aggagaaaca ttaccacaaa aaaagtcagt ttcatcctgc agtgttcccg cagcaaccat 300 

attaagctgg aagaataaag ctcctttgta gtagtagcgt ctattgaatt acaatgtaaa 3 60 

tgttgatgcc tcccctggaa acgtcaaatg acaacaggtt tacagaacga attcagtggt 420 

tgtttgcagc tctggaatcc caggctgcag agttgagatc gaagaggtgc gtctggccat 480 

cttagcgttc attcaaccaa atctcctcac tccatcaaaa ccacgccact ccccacctgc 54 0 

ccagctcgta aaagga ■ 55 g 

<210> 398 
<211> 893 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003239 
<400> 398 

gagttttagc aggaggggat atgcatgtac acatatgcat atacatacag gtttatttaa 60 
gtgattggat ctacactttt ggaggctatt tgaagtatct tatcaaaata aaacgcattt 120 
taagcaagag gttaaaaaag cttttcagac ocgaaggcca tttaatacag taaatcttac 180 
ttgggtagtt tagcaaacat tttttaaaat ccacatccaa cagattggtt agtattagat 240 
acacagattt ggtttcaaag ctgaaatacc cctggtggaa taaatcttct catgaaattc 300 
agttatatta tgccacaaaa cagggacact tatctattga caacttaaat attaactcag 360 
taggctacct ggtgatataa ataggaaaaa aacagtggtc atggattagg agtgagaaga 420 
ctatgacacc agcctacctt ggactgggac cgcaaaggat gttaaaatct atcctagaat 4 80 
ttaaaacagg aaaaatgtca aatattccac tccccattcg gcccaaacca tcccccaagg 54 0 
aatacacccc caccccaccc ccacatgagg tttactaacc agctgctaca gatgttacta 600 
tgtcactaat taacaatgat cacagaaaat gtgtcatacc agagtttttg ttatcataca 660 
caatgcatta gtggctgcag gaatcttcct ccaaaagcta ccgtcagaag gcaaactgac 720 
aaatgtccat gccgctggtc caagaaacca cagctaagag ccatgaggca ttcatttgta 780 
cattgagctc tctctggcat gctccccact ctagttagag gggctggggt atggagagac 84 0 
cgtcccagcc gggcgctgca gcctcctact cctgggcctc ttgcgggtca act 893 

<210> 399 
<211> 2470 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_014791 
<400> 399 

ttggcgggcg gaagcggcca caacccggcg atcgaaaaga ttcttaggaa cgccgtacca 60 
gccgcgtctc tcaggacagc aggcccctgt ccttctgtcg ggcgccgctc agccgtgccc 120 
tccgcccctc aggttctttt tctaattcca aataaacttg caagaggact atgaaagatt 180 
atgatgaact tctcaaatat tatgaattac atgaaactat tgggacaggt ggctttgcaa 240 
aggtcaaact tgcctgccat atccttactg gagagatggt agctataaaa atcatggata 300 
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aaaacacact agggagtgat ttgccccgga tcaaaacgga gattgaggcc ttgaagaacc 360 
tgagacatca gcatatatgt caactctacc atgtgctaga gacagccaac aaaatattca 420 
tggttcttga gtactgccct ggaggagagc tgtttgacta tataatttcc caggatcgcc 480 
tgtcagaaga ggagacccgg gttgtcttcc gtcagatagt atctgctgtt gcttatgtgc 540 
acagccaggg ctatgctcac agggacctca agccagaaaa tttgctgttt gatgaatatc 600 
ataaattaaa gctgattgac tttggtctct gtgcaaaacc caagggtaac aaggattacc 660 
atctacaga'c atgctgtggg agtctggctt atgcagcacc tgagttaata caaggcaaat 720 
catatcttgg atcagaggca gatgtttgga gcatgggcat actgttatat gttcttatgt 780 
gtggatttct accatttgat gatgataatg taatggcttt atacaagaag attatgagag 84 0 
gaaaatatga tgttcccaag tggctctctc ccagtagcat tctgcttctt caacaaatgc 900 
tgcaggtgga cccaaagaaa cggatttcta tgaaaaatct attgaaccat ccctggatca 960 
tgcaagatta caactatcct gttgagtggc aaagcaagaa tccttttatt cacctcgatg 1020 

atgattgcgt aacagaactt tctgtacatc acagaaacaa caggcaaaca atggaggatt 1080 
taatttcact gtggcagtat gatcacctca cggctaccta tcttctgctt ctagccaaga 1140 
aggctcgggg aaaaccagtt cgtttaaggc tttcttcttt ctcctgtgga caagccagtg 1200 
ctaccccatt cacagacatc aagtcaaata attggagtct ggaagatgtg accgcaagtg 1260 
ataaaaatta tgtggcggga ttaatagact atgattggtg tgaagatgat ttatcaacag 1320 
gtgctgctac tccccgaaca tcacagttta ccaagtactg gacagaatca aatggggtgg 1380 
aatctaaatc attaactcca gccttatgca gaacacctgc aaataaatta aagaacaaag 1440 
aaaatgtata tactcctaag tctgctgtaa agaatgaaga gtactttatg tttcctgagc 1500 
caaagactcc agttaataag aaccagcata agagagaaat actcactacg ccaaatcgtt 1560 
acactacacc ctcaaaagct agaaaccagt gcctgaaaga aactccaatt aaaataccag 1620 
taaattcaac aggaacagac aagttaatga caggtgtcat tagccctgag aggcggtgcc 1680 
gctcagtgga attggatctc aaccaagcac atatggagga gactccaaaa agaaagggag 1740 
ccaaagtgtt tgggagcctt gaaagggggt tggataaggt tatcactgtg ctcaccagga 1800 
gcaaaaggaa gggttctgcc agagacgggc ccagaagact aaagcttcac tataatgtga 1860 
ctacaactag attagtgaat ccagatcaac tgttgaatga aataatgtct attcttccaa 1920 
agaagcatgt tgactttgta caaaagggtt atacactgaa gtgtcaaaca cagtcagatt 1980 
ttgggaaagt gacaatgcaa tttgaattag aagtgtgcca gcttcaaaaa cccgatgtgg 2040 
tgggtatcag gaggcagcgg cttaagggcg atgcctgggt ttacaaaaga ttagtggaag 2100 
acatcctatc tagctgcaag gtataattga tggattcttc catcctgccg gatgagtgtg 2160 
ggtgtgatac agcctacata aagactgtta tgatcgcttt gattttaaag ttcattggaa 2220 
ctaccaactt gtttctaaag agctatctta agaccaatat ctctttgttt ttaaacaaaa 2280 
gatattattt tgtgtatgaa tctaaatcaa gcccatctgt cattatgtta ctgtcttttt 2340 
taatcatgtg gttttgtata ttaataattg ttgactttct tagattcact tccatatgtg 2400 
aatgtaagct cttaactatg tctctttgta atgtgtaatt tctttctgaa ataaaaccat 24 60 
ttgtgaatat 2470 

<210> 400 
<211> 2212 
<212> DNA 
<213> Homo 

<300> 

<308> X05610 
<400> 400 

ggggaacgag gcccacctgg gagcccagga cttcaggggt tcccaggcat cacaccccct 60 

tccaacatct ctggggcacc tggtgacaaa ggggcgccag ggatatttgg cctgaaaggt 120 

tatcggggcc caccagggcc accaggttct gctgctcttc ctggaagcaa aggtgacaca 18 0 

gggaacccag gagctccagg aaccccaggg accaaaggat gggccgggga ctccgggccc 24 0 

cagggcaggc ctggtgtgtt tggtctccca ggagaaaaag ggcccagggg tgaacaaggc 300 

ttcatgggga acactggacc caccggggcg gtgggcgaca gaggccccaa gggacccaag 360 

ggagacccag gattccctgg tgcccccggg actgtgggag cccccgggat tgcaggaatc 420 

ccccagaaga ttgccatcca accagggaca gtgggtcccc aggggaggcg aggcccccct 480 

ggggcaccgg gggagatcgg gccccagggc ccccccggag aaccaggttt tcgtggggct 54 0 

ccagggaaag ctgggcccca aggaagaggt ggtgtgtctg ctgttcccgg cttccgggga 600 

gatgaaggac ccataggcca ccaggggccg attggccaag aaggtgcacc aggccgtcca 660 

gggagcccgg gcctgccggg tatgccaggc cgcagcgtca gcatcggcta cctcctggtg 720 

aagcacagcc agacggacca ggagcccatg tgcccggtgg gcatgaacaa actctggagt 780 
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ggatacagcc tgctgtactt cgagggccag gagaaggcgc acaaccagga cctggggctg 840 
gcgggctcct gcctggcgcg gttcagcacc atgcccttcc tgtactgcaa ccctggtgat 900 
gtctgctact atgccagccg gaacgacaag tcctactggc tctctaccac tgcgccgctg 960 
cccatgatgc ccgtggccga ggacgagatc aagccctaca tcagccgctg ttctgtgtgt 1020 
gaggccccgg ccatcgccat cgcggtccac agtcaggatg tctccatccc acactgccca 1080 
gctgggtggc ggagtttgtg gatcggatat tccttcctca tgcacacggc ggcgggagac 1140 
gaaggcggtg gccaatcact ggtgtcaccg ggcagctgtc tagaggactt ccgcgccaca 1200 
ccattcatcg aatgcaatgg aggccgcggc acctgccact actacgccaa caagtacagc 1260 
ttctggctga ccaccattcc cgagcagagc ttccagggct cgccctccgc cgacacgctc 1320 
aaggccggcc tcatccgcac acacatcagc cgctgccagg tgtgcatgaa gaacctgtga 1380 
gccggcgcgt gccaggaagg gccattttgg tgcttattct taacttatta cctcaggtgc 1440 
caaccaaaaa ttggttttat ttttttctta aaaaaaaaaa aaagtctacc aaaggaattt 1500 
gcatccagca gcagcactta gacctgccag ccactgtcac cgagcgggtg caagcactcg 1560 
gggtccctgg aggcaagccc tgcccacaga aagccaggag cagccctggc ccccatcagc 1620 
cctgctacga cgcaccgcct gaaggcacag ctaaccactt cgcacacacc catgtaacca 1680 
ctgcactttc caatgccaca gacaactcac attgttcaac tccttctcgg ggtgggacag 1740 
acgagacaac agcacacagg cagccagccg tggccagagg ctcgaggggc tcaggggctc 1800 
aggcacccgt ccccacacga gggccccgtg ggtggcctgg ccctgctttc tacgccaatg 18 60 
ttatgccagc tccatgttct cccaaatacc gttgatgtga attattttaa aggcaaaact 1920 
gtgctcttta ttttaaaaaa cactgataat cacactgcgg taggtcattc ttttgccaca 1980 
tccctataga ccactgggtt tggcaaaact caggcagaag tggagacctt tctagacatc 204 0 
attgtcagcc ttgctacttg aaggtacacc ccatagggtc ggaggtgctg tccccactgc 2100 
cccaccttgt ccctgagatt taacccctcc actgctgggg gtgagctgta ctcttctgac 2160 
tgccccctcc tgtgtaacga ctacaaaata aaacttggtt ctgaatattt tt 2212 

<210> 401 
<211> 4221 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_016448 
<400> 401 

cgataacgat ttgtgttgtg agaggcgcaa gctgcgattt ctgctgaact tggaggcatt 60 
tctacgactt ttctctcagc tgaggctttt cctccgaccc tgatgctctt caattcggtg 120 
ctccgccagc cccagcttgg cgtcctgaga aatggatggt cttcacaata ccctcttcaa 180 
tcccttctga ctggttatca gtgcagtggt aatgatgaac acacttctta tggagaaaca 240 
ggagtcccag ttcctccttt tggatgtacc ttctcttctg ctcccaatat ggaacatgta 300 
ctagcagttg ccaatgaaga aggctttgtt cgattgtata acacagaatc acaaagtttc 360 
agaaagaagt gcttcaaaga atggatggct cactggaatg ccgtctttga cctggcctgg 420 
gttcctggtg aacttaaact tgttacagca gcaggtgatc aaacagccaa attttgggac 480 
gtaaaagctg gtgagctgat tggaacatgc aaaggtcatc aatgcagcct caagtcagtt 54 0 
gccttttcta agtttgagaa agctgtattc tgtacgggtg gaagagatgg caacattatg 600 
gtctgggata ccaggtgcaa caaaaaagat gggttttata ggcaagtgaa tcaaatcagt 660 
ggagctcaca atacctcaga caagcaaacc ccttcaaaac ccaagaagaa acagaattca 720 
aaaggacttg ctccttctgt ggatttccag caaagtgtta ctgtggtcct ctttcaagac 780 
gagaatacct tagtctcagc aggagctgtg gatgggataa tcaaagtatg ggatttacgt 840 
aagaattata ctgcttatcg acaagaaccc atagcatcca agtctttcct gtacccaggt 900 
agcagcactc gaaaacttgg atattcaagt ctgattttgg attccactgg ctctacttta 960 
tttgctaatt gcacagacga taacatctac atgtttaata tgactgggtt gaagacttct 1020 
ccagtggcta ttttcaatgg acaccagaac tctacctttt atgtaaaatc cagccttagt 1080 
ccagatgacc agtttttagt cagtggctca agtgatgaag ctgcctacat atggaaggtc 1140 
tccacaccct ggcaacctcc tactgtgctc ctgggtcatt ctcaagaggt cacgtctgtg 1200 
tgctggtgtc catctgactt cacaaagatt gctacctgtt ctgatgacaa tacactaaaa 12 60 
atctggcgct tgaatagagg cttagaggag aaaccaggag gtgataaact ttccacggtg 1320 
ggttgggcct ctcagaagaa aaaagagtca agacctggcc tagtaacagt aacgagtagc 1380 
cagagtactc ctgccaaagc ccccagggta aagtgcaatc catccaattc ttccccgtca 14 40 
tccgcagctt gtgccccaag ctgtgctgga gacctccctc ttccttcaaa tactcctacg 1500 
ttctctatta aaacctctcc tgccaaggcc cggtctccca tcaacagaag aggctctgtc 1560 
tcctccgtct ctcccaagcc accttcatct ttcaagatgt cgattagaaa ctgggtgacc 1620 
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cgaacacctt cctcatcacc acccatcact ccacctgctt cggagaccaa gatcatgtct 1680 
ccgagaaaag cccttattcc tgtgagccag aagtcatccc aagcagaggc ttgctctgag 1740 
tctagaaata gagtaaagag gaggctagac tcaagctgtc tggagagtgt gaaacaaaag 1800 
tgtgtgaaga gttgtaactg tgtgactgag cttgatggcc aagttgaaaa tcttcatttg 18 60 
gatctgtgct gccttgctgg taaccaggaa gaccttagta aggactctct aggtcctacc 1920 
aaatcaagca aaattgaagg agctggtacc agtatctcag agcctccgtc tcctatcagt 1980 
ccgtatgctt cagaaagctg tggaacgcta cctcttcctt tgagaccttg tggagaaggg 2040 
tctgaaatgg taggcaaaga gaatagttcc ccagagaata aaaactggtt gttggccatg 2100 
gcagccaaac ggaaggctga gaatccatct ccacgaagtc cgtcatccca gacacccaat 2160 
tccaggagac agagcggaaa gacattgcca agcccggtca ccatcacgcc cagctccatg 2220 
aggaaaatct gcacatactt ccatagaaag tcccaggagg acttctgtgg tcctgaacac 2280 
tcaacagaat tatagattct aatctgagtg agttactgag ctttggtcca ctaaaacaag 2340 
ctgagctttg gtccactaaa acaagatgaa aaatacaaga gtgactctat aactctggtc 24 00 
tttaagaaag ctgccttttc atttttagac aaaatctttt caacgctgaa atgtacctaa 24 60 
tctggttcta ctaccataat gtatatgcag cttcccgagg atgaatgctg tgtttaaatt 2520 
tcataaagta aatttgtcac tctagcattt tgaatgaata gtcttcactt tttaaattat 2580 
tcatcttctc tataataatg acatcccagt tcatggaggc aaaaaacaag tttcttgtta 2640 
tcctgaaact ttctatgctc agtggaaagt atctgccagc cacagcatga ggcctgtgaa 27 00 
ggctgactga gaaatcctct gctgaagacc cctggttctg ttctgcctcc aacatgtata 27 60 
attttatttg aaatacataa tcttttcact atgcttttgt ggggtttttt ttaagtatgt 2820 
gtaaaaatgt gatgctcaga taagtacatt tatatcagtt cagtgttaaa atgcagtctc 2880 
ttgagttaaa gtcatcttta ttttaaatgc agtgataaat gtcaactctt cggagaaact 2940 
aggagaacaa caacagaaag ctgtgtttgt cttttttctc tcaaatatat ctcccgtatg 3000 
agatttcagg tccccatgtt ttcaccaagc aatctgctat gtcagccaac ccaacatcac 30 60 
tttctacagg aggttatgat ttttgccatt tactagagga agatgtttta tgaaatcaat 3120 
ttggggtttg aattcaggtg cagtcatcag ttctttaggg gctgcaatgt tttaaaaaaa 3180 
ataagtcatc agattttaag aaaaaagtga tgatttctta ttgatatttt tgtaacagaa 3240 
tatagctctt aactgaaaat ccagaaccag aaacataaat cttgagtttc ttttcatgta 3300 
cataaaaagc aatagccttt tagtatagat agccctgagc caaaaagtaa tagaattttc 3360 
tctagatatt taatacagag agtgtataga ctgactctaa gttaataatg tgcaaaatat 34 20 
cttaaacatc cctcccctta ttcaacaatt atgtatcagt gatcttgaac cattgtttta 34 80 
tatttttcac ctttgtaacc tcatggaaag aggctttaca tactttctat gtactattta 3540 
cttagaaggg agcccccttc cagtcatgaa acttcatttg ttttatccat atccctgagg 3600 
actgtgtaga ctttatgtca gttctgtgta gactttatgt cagtttttgt cattatttga 3660 
aaatctattc tgacaacttt ttaattcctt tgatcttata agttaaagct gtaacaactg 3720 
aaattgcatg gatcaagtaa gcatagtttt atccagggag aaaaataaaa ggaagccata 3780 
gaattgctct ggtcaaaacc aagcacacca tagccttaac tgaatattta ggaaatctgc 3840 
ctaatctgct tatatttggt gtttgttttt tgactgttgg gctttgggaa gatgttattt 3900 
atgaccaata tctgccagta acgctgttta tctcacttgc tttgaaagcc aatgggggaa 3960 
aaaaatccat gaaaaaaaaa agattgataa agtagatgat tttgtttgta tccctaccca 4020 
tctcctggca gccctactga gtgaaattgg gatacatttg gctgtcagaa attataccga 4080 
gtctactggg tataacatgt ctcacttgga aagctagtac ttttaaatgg gtgccaaagg 4140 
tcaactgtaa tgagataatt atccctgcct gtgtccatgt cagactttga gctgatcctg 4200 
aataataaag ccttttacct t 4221 

<210> 402 

<211> 3224 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD18401 
<400> 402 

gtcccacatc ccgcatccgg 
gaatatgggc gggaaccact 
ctttgaccat tttcagattc 
cgtgcagaag cgagacacta 
catcgagagg gatgaggttc 
gcaccccttc ctggtcaatc 
ggtggacctg ctcctgggag 



catcccagcg gccgggcatg tagcagcggc agcaacggcg 60 

cccacaagcc ccccgtgttt gacgagaatg aggaagtcaa 120 

tgcgggccat tggtaaaggg agttttggaa aggtatgcat 180 

agaaaatgta tgcaatgaag tacatgaaca agcagaagtg 240 

ggaatgtttt ccgggagctg cagatcatgc aagggctgga 300 

tgtggtactc cttccaggat gaggaggaca tgttcatggt 360 

gcgacctgcg ctaccatctg cagcagaatg tgcatttcac 420 
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agaggggact gtgaaactct acatctgtga gctggcactg gccctggagt atcttcagag 4 80 
gtaccacatc atccacagag acatcaagcc agacaatatc ctgctggatg aacacggaca 54 0 
tgttcacatt acagacttca acatagcgac ggtagtgaaa ggagcagaaa gggcttcctc 600 
catggctggc accaagccct acatggctcc agaagtattc caggtgtaca tggacagagg 660 
ccccggatac tcgtaccctg tcgactggtg gtccctgggc atcacagcct atgagctgct 720 
gcggggctgg aggccgtacg aaatccactc ggtcacgccc atcgatgaaa tcctcaacat 780 
gttcaaggtg gagcgtgtcc actactcctc cacgtggtgc aaggggatgg tggccctgct 840 
gaggaagctc ctgaccaagg atcctgagag ccgcgtgtcc agccttcatg acatacagag 900 
cgtgccctac ttggccgaca tgaactggga cgcggtgttc aagaaggcac tgatgcccgg 960 
ctttgtgccc aataaaggga ggttgaactg cgatcccaca tttgagcttg aagagatgat 1020 
tctagaatcc aagccacttc acaaaaagaa gaagcgattg gcaaagaaca gatccaggga 1080 
tggcacaaag gacagctgcc cgctgaatgg acacctgcag cactgtttgg agactgtccg 1140 
ggaggaattc atcatattca acagagagaa gctcaggagg cagcagggac agggcagcca 1200 
gctcttggac accgacagcc gagggggagg ccaggcccaa agcaagctcc aggacgggtg 1260 
caacaacaac ctcctcaccc acacctgcac ccgtggctgc agcagctgag cccacacttg 1320 
ttgctgctca acaggactgc actcgtctct gccctgccca cccagagcco ctctttgtgc 1380 
cctgatggtc cctgtctcac ccctgaaaac atcagatgca gaaaaagccc tggacttgga 1440 
gctgggaagc ctgggttctg gtcccatctc catgactgat tcacgtgtga cctcagacaa 1500 
gtcacgccct ctctgtgcct ccgttttctg catctgccaa aggggttaaa cacttctgcc 1560 
ccacttcaaa ttacaagatt atggggagaa cccaattagg taggaaacat gaaaaacctt 1620 
tgatatttat aaaatcattt ttacgtgcaa aatataacct taatatttga agtgaccccc 1680 
attccccaaa gcaatcaaac cgtcatgact ttgcaatttg gcacatccta gcttgttaga 1740 
gggcacttcc gaaaaacaca gccctgacag caaaataaag gtctgatatg ttggcccctt 1800 
ctatggaaac aacgctgcca aatcctggag caaaacctga agtgtcttca tgtgcattct 1860 
ctggcaggcc acagtcctga gcttgtaaga tggtgcagca tgcagaccag acttgtcccc 1920 
aaggtctcag cgctgcggtc tcactcctcc cctcatttaa gaagactatc cttacctttt 1980 
agtttcagca gtcctcacca ccaccatatc cccagtgctg ggatggcaca caggtgtcca 2040 
ttcagatgag agttgggtcg ctgagcattg gttactcctg cagagtgtaa tcagcacccc 2100 
atccaactgg cccgaaagcc cagacctgca gcagaactct ccaactctct atcagctttc 2160 
agggttttct ctcctgggaa gggtgtaaaa tcagcttgtc agattcttct tacagagagt 2220 
atccaatcgg tattggtgga gcggctccct atttatacaa taggaagcat gggtgcttag 2280 
aaagtttatt tcaggaggaa aatgggttca cacaaaaagc aaactacatt ctgatctgct 2340 
cagggagaag cttgcctttg aactggaaga tgttgggatg agcagggaaa gcttagactt 2400 
tggagtcagg tttgtgttca gaatccagcc ctgctggcta ctaactaact gggagacctt 24 60 
aggcaaagca tgcaatcgct ctgaatggca gtttcctcat ttttaaacag ggataataaa 2520 
actaatattg caggggagtt acagggttaa ataagatcc't gtgtgtaacc ccaagcattg 2580 
gatgactcat agaatggcct tttttgtcag cataatcgtc atcattattt agatactttc 2640 
ttccttcact cacccagcag gtcagttttc tgtgcaaaca aacctgttta ggattcttcc 2700 
aaatgttctt cctggggtct ttgatatttg tttgttacat cctgctgaag ttcgactgtg 2760 
tttttatttt ttcatccaac ttccattttt cactttttac atgattactc aatccttggg 2820 
gctgtccatg tcatctctta gatttcttaa aagacatttt aatgtatggt taggttttat 2880 
atttttattt tttaaaaaag aaatagtcag tgttttcctc ctttcaaccg agactatttc 2940 
tggattgtgt gctcctcgtc agttgacttg ttttgcacac ttttctttac ttcatgtccc 3000 
catcaacaac cgtcctgctc cccacctccc ccaggaaata aggggcctgc tcctctccct 3060 
actgtgaccc tggaggctct taagatgatg atggtttttt ttattgggct gagttcacga 3120 
attaggggca ggagctggaa gtcgccctag gaacaccaga tttcctggtt ctgttcaagt 3180 
tggcatttct tgtttggaat aaactatttc ttggacattc cttc 3224 

<210> 403 

<211> 2460 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000788 
<400> 403 

aaagtcaaac cccgacaccg cggcgggccg gtgagctcac tagctgaccc ggcaggtcag 60 
gatctggctt agcggcgccg cgagctccag tgcgcgcacc cgtggccgcc tcccagccct 120 
ctttgccgga cgagctctgg gccgccacaa gactaaggaa tggccacccc gcccaagaga 180 
agctgcccgt ctttctcagc cagctctgag gggacccgca tcaagaaaat ctccatcgaa 240 



180 



WO 2006/084272 PCT/US2006/004280 



gggaacatcg ctgcagggaa gtcaacattt gtgaatatcc ttaaacaatt gtgtgaagat 300 
tgggaagtgg ttcctgaacc tgttgccaga tggtgcaatg ttcaaagtac tcaagatgaa 360 
tttgaggaac ttacaatgtc tcagaaaaat ggtgggaatg ttcttcagat gatgtatgag 420 
aaacctgaac gatggtcttt taccttccaa acatatgcct gtctcagtcg aataagagct 480 
cagcttgcct ctctgaatgg caagctcaaa gatgcagaga aacctgtatt attttttgaa 540 
cgatctgtgt atagtgacag gtatattttt gcatctaatt tgtatgaatc tgaatgcatg 600 
aatgagacag agtggacaat ttatcaagac tggcatgact ggatgaataa ccaatttggc 660 
caaagccttg aattggatgg aatcatttat cttcaagcca ctccagagac atgcttacat 720 
agaatatatt tacggggaag aaatgaagag caaggcattc ctcttgaata tttagagaag 780 
cttcattata aacatgaaag ctggctcctg cataggacac tgaaaaccaa cttcgattat 840 
cttcaagagg tgcctatctt aacactggat gttaatgaag actttaaaga caaatatgaa 900 
agtctggttg aaaaggtcaa agagtttttg agtactttgt gatcttgctg aagactacag 960 
gcagccaaat ggttccagat acttcagctt tgtgtatctt cgtaacttca tattaatata 1020 
agtttcttta gaaaacccaa gtttttaatc gtttttgttt taaggaaaaa agatttttaa 1080 
aatgaatctt atgcaaaact ttttgatcag tttcttttct tttgtttttt ttttaaaaaa 1140 
gacatttaaa gacaaagaca ttatttctca tagcaggaaa tgtagaggta gatggttcca 1200 
gtatcagcat agtgactaaa ctacattata aaagatccag cttccttctg tcattcccct 1260 
cttttgtctt cctcagcagg ttggcttttt tccctggtgc ctctcacttc gttggtgacc 1320 
agtttcttaa actgaaagct ttaatgttac atagtaaatg gtagtgtgtc ctgtgtaaat 1380 
tagtgtacct attaaaagtt gcaaagtgga attaaaggaa tccctagaat aaggattctg 1440 
aagttttatt ttaaattatt atcttcttaa cagtttagtc ccacctctta cttcctgcct 1500 
cagtctgctt tctctactgt ctggattaat taggcagcct gctataaagt taaagtcaca 1560 
catttctatt ttgcaaacac tgtgattact ctttgctttg tagtttgctt tgctttgtag 1620 
ggttctgctt ttaagttttt ctctttttca gacaaattac tgataaaaat gatattgctc 1680 
tatatgtaat atatcctgaa agcattattt tttgttgaat aggaaataaa attaatgaag 1740 
acagaggcta gaaagcatcc attaattaat gagacacact taactactta tctctaaacc 1800 
atctatgtga atatttgtaa aaataatgaa tggactcatc ttagttctgt atataaatat 1860 
attttctttc tagtttgttt agttaaggtg tgcagtgttt ttcctgtgta ttaaaccttt 1920 
ccattttacg ttttagaaaa ttttatgtat tttaaaataa ggggaagagt cattttcacc 1980 
tttaaactac tatttttctt tccaagtcat ttttgttttt ggtttcttat tcaaagatga 2040 
taatttagtg gattaaccag tccagacgca ctgatctttg caaaggagac ttaatttcaa 2100 
atctgtaatt accatacata aactgtctca ttatacgtat gcattttttt agtttgtttt 2160 
tgtttggtat aaattaattt gttaattaaa tatttcttaa gtataaacct tatgaactac 2220 
agtggagcta cactcattga aatgtaattt cagttctaaa aagatgtaat aatcatttta 2280 
gaattaaaat ttattctact tttaaataaa ttatgaatat taaaggtgaa aattgtataa 2340 
attactttga ttccatttta agtggagaca tatttcagtg atttttagta acctttaaaa 2400 
atgtataatg acttttaaaa tttgtagaat tgaaaagacg ctaataaaaa tttattattt 24 60 



<210> 404 
<211> 558 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig51464_RC 
<400> 404 

agagtttcac aaataacatt tattgaccac 

tgaactttgc tacaaccaag aatcacctga 

tggattctgt tctgagaaga cagcgactgt 

aagatcttga tcgtgacaca tttttgtggc 

tcaaacccca' aaacactggc gaatggccat 

caggaggccg aggggcagca gacgcgctgc 

ctggagctgc tggaacttgg acaacaggca 

ccagcagcat gcatccagca ggggtgctgc 

taaggcagtg ctgcttccac cctggacaca 
tcacagcaga tggccagg 

<210> 405 
<211> 1929 



attcataaca gaaactcacc cacattcaca 60 
tgagaaatca tttgggttga atcctgggtg 120 
cacagacgcc tcctgattgg tggactctca 180 
tgttcttgac agtagctgaa gattccaact 240 
gatgaccctc gggatggcct ggaggagggt 300 
aggagccggt agctggtcct ggtagggggc 360 
ctcatcaccg cagatccgcc agcagttgtg 420 
tttcggggtc tcagggacag agggacccag 480 
ggcaggcggc ccccgaatct ggtctggcat 54 0 
558 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL080079 
<400> 405 

gggggagaat ttccattagg caagctgtat gcagtggaat tttcctttta ggagacacac 60 
aattaagact ctctggttct gtccttgctt tggagacttt aagacatttc ctaaagcaca 120 
aataaaagcc tcgtatttcc ccattgagag ttttgttcca aggaatatga agtgagacat 180 
atgggtgagt tataataatc aaaataattt atgaagagct gggtctgcaa tagctagtct 240 
aaaaactact tgtgtgtcag tcctctggtt atagtatata agagcctgag gaggtctggc 300 
aagatagatg gtgtattatt tatggatcag gctgctgcat acaaaccttg catactatta 360 
tgcagcttac ctaactctca gactattctg agtaatgctt gcttgctaat gaatgtatag 420 
gagaccacat tgtaattgtt cttagatgat ggagtccatg cagtttctta gaaatcggtc 480 
tcagtgcatg ctgtgctttt tcacatttgc tctgggttat ctgggaagta tcaggttctg 540 
ggaggcaaca gcattaagtg ataagaaaag gagacatfcct ggcaaagcca atctgcttaa 600 
aggcaaagtc cagaacctgg aacctagagg cctttctctc tgcacgaaaa acaggtagtt 660 
tgcagtctga gatatgggag agcttttagg ctacacagca acccaaggga cctctcacct 720 
tttgctgagc ttcaatcagg aagctatttg cctggctcca gcagatgatg agataatgag 780 
gtagtgggtt ttttattact gttccatttt gcaacatcct gcaacaccat cctgggagac 840 
aagagcatta ccgagcttgg ctttcacggg ggagggttgt attcagtaaa aaagaatagt 900 
aaatataagg tcactgagat tctaagtaag atagtaaatc taaggtcact gagccaaatc 960 
cttttcaata ggcatatatt aacaggctgc ttattttggc ggaggttaca tatggatgaa 1020 
aatgaatctt agtcactgaa tattcatata catttccccc aaaaccttag acattcatag 1080 
tagattttaa ttagttagct ttctaactag tcagatttct gcccaaagtg cttagtcaat 114 0 
agtaattaag atatagcttc aaaaatggct ttgcttttga tttctactca tattcgtatg 1200 
gctccagaaa aatatttttt tcatatttga caatgtcagc tccactttag aaattttcaa 1260 
taaccagatg agaaaaaaat taagaaattg ctcaagggaa acatttgtaa atggatttga 1320 
aagattgagc caaattctgt tgtcagttct aagcatgcag ttctcacctc catttagtcc 1380 
cccatcagaa cagaggtcag gaatttagct ggggagccta aatttagttc agcttacctt 1440 
tgagaatagc atcaattcag actctctttt cattatgttt tcttttcttt ttccctcttt 1500 
ttaaactaca ttgtgttaga gtcgtagtct aggatcctga gagattttcc attcttgtca 1560 
ccattcactt gcattgtaaa gattttcttt gtctgttgtt ggcatagatt cttttgtaca 1620 
tatttattta tttgtgttta tatatgtcaa ttggtttcct ttcttagctt gatattgcct 1680 
agctttgttg ttttaattaa ctttctatta gagggactgt atatattttt tctaaatact 1740 
ttgtgaaatc atttttggta gcaatatctt tgaatatgat gaataaaagt gactgtgagt 1800 
gcaaatagaa ttagcagtaa gaagctactc tagctaattt gccatttcac ttaaatggaa 1860 
aatgtttttc aaataaatac ttatgttgtt catgttccaa gttattcaaa aaaaaaaaaa 1920 
aaaaaaaac 1929 

<210> 406 
<211> 3915 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_006931 
<400> 406 

gtggggtggg gtggggctgg gggcttgtcg ccctttcagg ctccaccctt tgcggagatt 60 
ataaatagtc atgatcccag cgagacccag agatgcctgt aatggtgaga ctttggatcc 120 
ttcctgagga cgtggagaaa actttctgct gagaaggaca ttttgaaggt tttgttggct 180 
gaaaaagctg tttctggaat cacccctaga tctttcttga agacttgaat tagattacag 240 
cgatggggac acagaaggtc accccagctc tgatatttgc catcacagtt gctacaatcg 300 
gctctttcca atttggctac aacactgggg tcatcaatgc tcctgagaag atcataaagg 360 
aatttatcaa taaaactttg acggacaagg gaaatgcccc accctctgag gtgctgctca 420 
cgtctctctg gtccttgtct gtggccatat tttccgtcgg gggtatgatc ggctcctttt 480 
ccgtcggact cttcgtcaac cgctttggca ggcgcaattc aatgctgatt gtcaacctgt 540 
tggctgtcac tggtggctgc tttatgggac tgtgtaaagt agctaagtcg gttgaaatgc 600 
tgatcctggg tcgcttggtt attggcctct tctgcggact ctgcacaggt tttgtgccca 660 
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tgtacattgg agagatctcg cctactgccc tgcggggtgc ctttggcact ctcaaccagc 720 
tgggcatcgt tgttggaatt ctggtggccc agatctttgg tctggaattc atccttgggt 780 
ctgaagagct atggccgctg ctactgggtt ttaccatcct tcctgctatc ctacaaagtg 840 
cagcccttcc attttgccct gaaagtccca gatttttgct cattaacaga aaagaagagg 900 
agaatgctaa gcagatcctc cagcggttgt ggggcaccca ggatgtatcc caagacatcc 960 
aggagatgaa agatgagagt gcaaggatgt cacaagaaaa gcaagtcacc gtgctagagc 1020 
tctttagagt gtccagctac cgacagccca tcatcatttc cattgtgctc cagctctctc 1080 
agcagctctc tgggatcaat gctgtgttct attactcaac aggaatcttc aaggatgcag 1140 
gtgttcaaga gcccatctat gccaccatcg gcgcgggtgt ggttaatact atcttcactg 1200 
tagtttctct atttctggtg gaaagggcag gaagaaggac tctgcatatg ataggccttg 1260 
gagggatggc tttttgttcc acgctcatga ctgtttcttt gttattaaag gataactata 1320 
atgggatgag ctttgtctgt attggggcta tcttggtctt tgtagccttc tttgaaattg 1380 
gaccaggccc cattccctgg tttattgtgg ccgaactctt cagccagggc ccccgcccag 1440 
ctgcgatggc agtggccggc tgctccaact ggacctccaa cttcctagtc ggattgctct 1500 
tcccctccgc tgctcactat ttaggagcct acgtttttat tatcttcacc ggcttcctca 1560 
ttaccttctt ggcttttacc ttcttcaaag tccctgagac ccgtggoagg acttttgagg 1620 
atatcacacg ggcctttgaa gggcaggcac acggtgcaga tagatctgga aaggacggcg 1680 
tcatggagat gaacagcatc gagcctgcta aggagaccac caccaatgtc taagtcgtgc 1740 
ctccttccac ctccctcccg gcatgggaaa gccacctctc cctcaacaag ggagagacct 1800 
catcaggatg aacccaggac gcttctgaat gctgctactt aattcctttc tcatcccacg 1860 
cactccatga gcaccccaag gctgcggttt gttggatctt caatggcttt ttaaatttta 1920 
tttcctggac atcctcttct gcttaggaga gaccgagtga acctaccttc atttcaggag 1980 
ggattggccg cttggcacat gacaactttg ccagcttttc ctcccttggg ttctgatatt 204 0 
gccgcactag gggatatagg agaggaaaag taaggtgcag ttcccccaac ctcagactta 2100 
ccaggaagca gatacatatg agtgtggaag ccggagggtg tttatgtaag agcaccttcc 2160 
tcacttccat acagctctac gtggcaaatt aacttgagtt ttatttattt tatcctctgg 2220 
tttaattaca taattttttt ttttttactt taagtttcag gatacatgtg ccgaatgtgc 2280 
aggtttgtta cataggtata tatatgccat gatggaaata tttatttttt taagcgtaat 2340 
tttgccaaat aataaaaaca gaaggaaatt gagattagag ggaggtgttt aaagagaggt 24 00 
tatagagtag aagatttgat gctggagagg ttaaggtgca ataagaattt agggagaaat 2460 
gttgttcatt attggagggt aaatgatgtg gtgcctgagg tctgtacgtt acctcttaac 2520 
aatttctgtc cttcagatgg aaactcttta acttctcgta aaagtcatat acctatataa 2580 
taaagctact gatttccttg gagctttttt ctttaagata atagtttaca tgtagtagta 2640 
cttgaaatct aggattatta actaatatgg gcattgtagt taatgatggt tgatgggttc 2700 
taattttgga tggagtccag ggaagagaaa gtgatttcta gaaagcctgt tcccctcact 27 60 
ggatgaaata actccttctt gtagtagtct cattactttt gaagtaatcc cgccacctat 2820 
ctcgtgggag agccatccaa ataagaaacc taaaataatt ggttcttggt agagattcat 2880 
tatttttcca ctttgttctt taggagattt taggtgttga ttttctgttg tattttaact 2940 
cataccttta aaggaattcc ccaaagaatg tttatagcaa acttggaatt tgtaacctca 3000 
gctctgggag aggatttttt tctgagcgat tattatctaa agtgtgttgt tgctttaggc 3060 
tcacggcacg cttgcgtatg tctgttacca tgtcactgtg gtcctatgcc gaatgccctc 3120 
aggggacttg aatctttcca ataaaccagg tttagacagt atgagtcaat gtgcagtgta 3180 
gcccacactt gagaggatga atgtatgtgc actgtcactt tgctctgggt ggaagtacgt 3240 
tattgttgac ttattttctc tgtgtttgtt cctacagccc ctttttcata tgttgctcag 3300 
tctccctttc ccttcttggt gcttacacat ctcagaccct ttagccaaac ccttgtcagt 3360 
gacagtattt tggttcttag ttctcactgt tccctctgct cctggagcct ttgaataaaa 3420 
atgcacgtag ctgaggccgg atgcggtggc tcacgcctgt aatcccagca ctttgggagg 3480 
cctaggcggg cggtcagggg ttcgagacca gtctggccaa catcgtgaaa ccctgtctct 3540 
actaaaaatg caaaaattag ccgggcgtgg tggcgggcgc ctgtaatccc agctacttgg 3600 
gaagctgagg cgggagaatc atgtgaaccc gggacgcagg ggttgcagtg agcggagatc 3660 
gcatcattgc actctagcct gggccacagg gcgagactcc gtctcaaaaa aaaaaaaatg 3720 
cacatagcta tcgagtgtgc tttagcttga aaaggtgacc ttgcaacttc atgtcaactt 3780 
tctggctcct caaacagtag gttggcagta aggcagggtc ccatttctca ctgagaagat 3840 
tgtgaatatt tccatatgga ttttctattg ttactctggt tctttgtttt aaaataaaaa 3900 
ttctgaatgt. acacg 3915 

<210> 407 

<211> 2125 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> AF257157 
<400> 407 

tggcgatggc gtacttggct tggagactgg cgcggcgttc gtgtccgagg tcactagttt 60 
cccggtagtt cagctgcaca tgaatagaac agcaatgaga gccagtcaga aggactttga 120 
aaattcaatg aatcaagtga aactcttgaa aaaggatcca ggaaacgaag tgaagctaaa 18 0 
actctacgcg ctatataagc aggccactga aggaccttgt aacatgccca aaccaggtgt 24 0 
atttgacttg atcaacaagg ccaaatggga cgcatggaat gcccttggca gcctgcccaa 300 
ggaagctgcc aggcagaact atgtggattt ggtgtccagt ttgagtcctt cattggaatc 360 
ctctagtcag gtggagcctg gaacagacag gaaatcaact gggtttgaaa ctctggtggt 420 
gacctccgaa gatggcatca caaagatcat gttcaaccgg cccaaaaaga aaaatgccat 480 
acacactgag atgtatcatg aaattatgcg tgcacttaaa gctgccagca aggatgactc 54 0 
aatcatcact gttttaacag gaaatggtga ctattacagt agtgggaatg atctgactaa 600 
cttcactgat attccccctg gtggagtaga ggagaaagct aaaaataatg ccgttttact 660 
gaggtaagag ttactgacct ttgtctgatc ccagagggcc cgttcttagt catcaaagtg 720 
gtatttagtt gagcaaaagt gcaatatctg cttctctgtg tcagtgcagt taattgtgac 780 
tttggacaag catatttctc taaactggct ccatccatag gtgggcattc attctttctc 840 
tttgacctct gttacctgtt catcctgctg gtctcatgag atttgactgg aaaacataaa 900 
agtgagctat agttgtaaac aaaattatct tgcattgagg cagaggaatt tagcaaattc 960 
tcatccagcc agaggaatgg gagatttcca tgaacgctga aggtaggaga ctcggaggag 1020 

tcttgtaaac acaggaggag gtcatgaagg cttttctctt tattcctgcc aaatctgtaa 1080 

ttcccacttg ctgtgatatt gcagcatgtg ctcagtcacc caacctaatc tctaagactg 1140 

cggctggttt acagccctga actcactgtc cagaatacaa gattcaaacc agggttgacc 1200 

cataggtctc actggttggt cttccatcca ctgccattca gacggatgcc ctggatgaat 1260 

ctggtgcgga agaccaggga agtggcagga caaggaggca ctgctggtgg tggctgtgga 1320 

aacacccagc ctgacagaag gctttatctc agagtcagac tgtcttgccc atgcttcagt 1380 

catgctgctt tggcacattt aatgattttg tgtttccatt tctgtaggga atttgtgggc 14 4 0 

tgttttatag attttcctaa gcctctgatt gcagtggtca atggtccagc tgtgggcatc 1500 

tccgtcaccc tccttgggct attcgatgcc gtgtatgcat ctgacagggc aacatttcat 1560 

acaccattta gtcacctagg ccaaagtccg gaaggatgct cctcttacac ttttccgaag 1620 

ataatgagcc cagccaaggc aacagagatg cttatttttg gaaagaagtt aacagcggga 1680 

gaggcatgtg ctcaaggact tgttactgaa gttttccctg atagcacttt tcagaaagaa 1740 

gtctggacca ggctgaaggc atttgcaaag cttcccccaa atgtcttgag aatttcaaaa 1800 

gaggtaatca ggaaaagaga gagagaaaaa ctacacgctg ttaatgctga agaatgcaat 1860 

gtccttcagg gaagatggct atcagatgaa tgcacaaatg ctgtggtgaa cttcttatcc 1920 

agaaaatcaa aactgtgatg accactacag cagagtaaag catgtccaag gaaggatgtg 1980 

ctgttacctc tgatttccag tactggaact aaataagctt cattgtgcct tttgtagtgc 2040 

tagaatatca attacaatga tgatatttca ctacagctct gatgaataaa aagttttgta 2100 

aaccaaaaaa aaaaaaaaaa aaaaa 2125 

<210> 408 

<211> 1446 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002916 
<400> 408 

ctgccattta ggacaagctg gatgatgatg gtttgatagc tccaggggtt cgtgtatagg 60 

agatgatgaa tctgcttcat ccagaatcac aatcttaaaa ggcgggaact gaggcgactg 120 

tggggacatc agtgatcgta agtctcctgg gcccgttatt ctcagattag gtgacggagc 180 

taagacttcg agaccatctc gtcctttttg tatcgcggaa acctgaggaa cgagccggcg 240 

gcggtgacct gcacgagaag ccaggctaac tgggtgaagt accatgcaag catttcttaa 300 

aggtacatcc atcagtacta aacccccgct gaccaaggat cgaggagtag ctgccagtgc 360 

gggaagtagc ggagagaaca agaaagccaa acccgttccc tgggtggaaa aatatcgccc 420 

aaaatgtgtg gatgaagttg ctttccagga agaagtggtt gcagtgctga aaaaatcttt 480 

agaaggagca gatcttccta atctcttgtt ttacggacca cctggaactg gaaaaacatc 540 

cactattttg gcagcagcta gagaactctt tgggcctgaa cttttccgat taagagttct 600 
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tgagttaaat gcatctgatg aacgtggaat 
tgctcaatta actgtgtcag gaagtcgctc 
tgtgattctg gatgaagcag attctatgac 
catggagaag gagtcgaaaa ccacccgatt 
aattgaaccc ctgacctcta gatgttcaaa 
tcaacagcag cgattactag acattgccaa 
aatagcttat cttgttaaag tgtcagaagg 
aagcgctact cgattaacag gtggaaagga 
tggggtaata ccagctgaga aaattgatgg 
tgacaaacta gaagctgtgg tcaaggattt 
cgtcaatcaa ctccatgatg tggttgtaga 
tatcacagaa aaacttgccg aagttgacaa 
gcaactcatc agcctttgtg caactgtgat 
tatatctgga tggggggttt tgtaaataat 
gcaccg 

<210> 409 

<211> 788 

<212> DNA 

<213> Homo sapiens 



acaagtagtt cgagagaaag tgaaaaattt 660 
agatgggaag ccgtgtccgc cttttaagat 720 
ctcagctgct caggcagctt taagacgtac 780 
ctgtcttatc tgtaactatg tcagtcgaat 840 
attccgcttc aagcctctgt cagataaaat 900 
gaaggaaaat gtcaaaatta gtgatgaggg 960 
agacttaaga aaagccatta catttcttca 1020 
gatcacagag aaagtgatta cagacattgc 1080 
agtatttgct gcctgtcaga gtggctcttt 114 0 
aatagatgag ggtcatgcag caactcagct 1200 
aaataactta tctgataaac agaagtctat 1260 
atgcctagca gatggtgctg atgaacattt 1320 
gcagcagtta tctcagaatt gttaacgtga 1380 
gaagttgtaa taaaaataaa atgaccaaaa 1440 
1446 



<300> 

<308> Contig55725_RC 



<400> 409 

caagttagaa caaacatttt attattaaaa 
aaagctgtat ttacacttat cgattcaagg 
agcaatggtt tacatatgca agtaaattgg 
aaataccttg attaatgcaa agaggagggg 
ctagactggc agagatgtca ggttaacaaa 
ttatcattgt tataaaagtg cttaaaatct 
ttacaagaaa ctgcaaagaa ccacagacta 
gtccaagaat ttatttgcaa taccttacat 
ttatactaaa gttttatact gaaaattcaa 
ttccatcatt attttaatta agtattacat 
aagctggacg caccacttac ccttcgggca 
gtgacaacaa cttgtaacat aagcgcaata 
attacctaat ttttccacat ggttttaaga 
aactagta 



taaacttttg tataaaagca ttacagatca 60 
tccaattatg catcaaacat tgaatggcac 120 
acatacaaac acttagattc cacctctacc 180 
gaatactgac acaggaaacc tgcccagaaa 24 0 
ctgctaaaag ttacatctcc aaaaaggcac 300 
aaacttgaac cttgtgcctg gtttataaat 360 
gtttttaata tcaagtttcc atacaaaatt 420 
gtgaactgaa ataaactttg caaactacaa 4 80 
atcagcttac tcataatctc cagatagctc 54 0 
tgcccaccaa cctttgaatt atgatcgtgt 600 
gaagaaatta ggagaaatta agtctcaact 660 
catgggattg agagacatca atctgcaata 720 
cttttgtgcc ttgattctat taaaaatttc 780 
788 



<210> 410 
<211> 433 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig24252_RC 



<400> 410 

cacgtaacac aaaagctttt acttggaaac 
ctcacgctct ggcgggagct cggatgcaga 
tgctcctctt tggcatgtat cagcagagcc 
gttctccatc aaggtatggg gacccctact 
gaagaagccc aggcagttga cccagtcttt 
cactggcaga tgagaggttt gcacctatac 
tgtcagtgtc caggtacctg ggctgggcca 
gggtaagggc tgc 



tggcaaatac tggactagaa tactgacatg 60 
agctattgca caaagcccct ctgattgcct 120 
ccaagggcca attgcccaca ggtggggact 180 
tccttgtttt gttaaaaagt gcaggtaggc 240 
gaaacagctg actccccagc acccagtgtc 300 
cagcccttgt ttccttggta gggctcaagt 360 
gagctggggc caggggagcc tggtcttgag 420 
433 



<210> 411 
<211> 969 
<212> DNA 
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<213> Homo sapiens 
<300> 

<308> AF201951 
<400> 411 

acggaacagc cagtgggagg ttccagctga gcgctcccca gaggtgagct gatccccagc 60 
cacagcacac aggaccaggc tgcgagaaca gcatcatcag catcatgcta ttacaatccc 120 
aaaccatggg ggtttctcac agctttacac caaagggcat cactatccct caaagagaga 180 
aacctggaca catgtaccaa aacgaagatt acctgcagaa cgggctgcca acagaaacca 240 
ccgttcttgg gactgtccag atcctgtgtt gcctgttgat ttcaagtctg ggggccatct 300 
tggtttttgc tccctacccc tcccacttca atccagcaat ttccaccact ttgatgtctg 360 
ggtacccatt tttaggagct ctgtgttttg gcattactgg atccctctca attatctctg 420 
gaaaacaatc aactaagccc tttgacctga gcagcttgac ctcaaatgca gtgagttctg 4 80 
ttactgcagg agcaggcctc ttcctccttg ctgacagcat ggtagccctg aggactgcct 540 
ctcaacattg tggctcagaa atggattatc tatcctcatt gccttattcg gagtactatt 600 
atccaatata tgaaatcaaa gattgtctcc tgaccagtgt cagtttaaca ggtgtcctag 660 
tggtgatgct catcttcact gtgctggagc tcttattagc tgcatacagt tctgtctttt 720 
ggtggaaaca gctctactcc aacaaccctg ggagttcatt ttcctcgacc cagtcacaag 780 
atcatatcca acaggtcaaa aagagttctt cacggtcttg gatataagta actcttggcc 840 
tcagaggaag gaaaagcaac tcaacactca tggtcaagtg tgattagact ttcctgaaat 900 
ctctgccatt ttagatactg tgaaacaaac taaaaaaaaa aaagcttttg ttttgtaaaa 960 
aaaaaaaaa 969 

<210> 412 
<211> 2917 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_005915 
<400> 412 

ggaaaaaagc gacttgtggc ggtcgagcgt ggcgcaggcg aatcctcggc actaagcaaa 60 
tatggacctc gcggcggcag cggagccggg cgccggcagc cagcacctgg aggtccgcga 120 
cgaggtggcc gagaagtgcc agaaactgtt cctggacttc ttggaggagt ttcagagcag 180 
cgatggagaa attaaatact tgcaattagc agaggaactg attcgtcctg agagaaacac 240 
attggttgtg agttttgtgg acctggaaca atttaaccag caactttcca ccaccattca 300 
agaggagttc tatagagttt acccttacct gtgtcgggcc ttgaaaacat tcgtcaaaga 360 
ccgtaaagag atccctcttg ccaaggattt ttatgttgca ttccaagacc tgcctaccag 420 
acacaagatt cgagagctca cctcatccag aattggtttg ctcactcgca tcagtgggca 480 
ggtggtgcgg actcacccag ttcacccaga gcttgtgagc ggaacttttc tgtgcttgga 540 
ctgtcagaca gtgatcaggg atgtagaaca gcagttcaaa tacacacagc caaacatctg 600 
ccgaaatcca gtttgtgcca acaggaggag attcttactg gatacaaata aatcaagatt 660 
tgttgatttt caaaaggttc gtattcaaga gacccaagct gagcttcctc gagggagtat 720 
cccccgcagt ttagaagtaa ttttaagggc tgaagctgtg gaatcagctc aagctggtga 780 
caagtgtgac tttacaggga cactgattgt tgtgcctgac gtctccaagc ttagcacacc 840 
aggagcacgt gcagaaacta attcccgtgt cagtggtgtt gatggatatg agacagaagg 900 
cattcgagga ctccgggccc ttggtgttag ggacctttct tataggctgg tctttcttgc 960 
ctgctgtgtt gcgccaacca acccaaggtt tggggggaaa gagctcagag atgaggaaca 1020 
gacagctgag agcattaaga accaaatgac tgtgaaagaa tgggagaaag tgtttgagat 1080 
gagtcaagat aaaaatctat accacaatct ttgtaccagc ctgttcccta ctatacatgg 1140 
caatgatgaa gtaaaacggg gtgtcctgct gatgctcttt ggtggcgttc caaagacaac 1200 
aggagaaggg acctctcttc gaggggacat aaatgtttgc attgttggtg acccaagtac 1260 
agctaagagc caatttctca agcacgtgga ggagttcagc cccagagctg tctacaccag 1320 
tggtaaagcg tccagtgctg ctggcttaac agcagctgtt gtgagagatg aagaatctca 1380 
tgagtttgtc attgaggctg gagctttgat gttggctgat aatggtgtgt gttgtattga 1440 
tgaatttgat aagatggacg tgcgggatca agttgctatt catgaagcta tggaacagca 1500 
gaccatatcc atcactaaag caggagtgaa ggctactctg aacgcccgga cgtccatttt 1560 
ggcagcagca aacccaatca gtggacacta tgacagatca aaatcattga aacagaatat 1620 
aaatttgtca gctcccatca tgtcccgatt cgatctcttc tttatccttg tggatgaatg 1680 
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taatgaggtt acagattatg ccattgccag gcgcatagta gatttgcatt caagaattga 1740 

ggaatcaatt gatcgtgtct attccctcga tgatatcaga agatatcttc tctttgcaag 1800 

acagtttaaa cccaagattt ccaaagagtc agaggacttc attgtggagc aatataaaca 1860 

tctccgccag agagatggtt ctggagtgac caagtcttca tggaggatta cagtgcgaca 1920 

gcttgagagc atgattcgtc tctctgaagc tatggctcgg atgcactgct gtgatgaggt 1980 

ccaacctaaa catgtgaagg aagctttccg gttactgaat aaatcaatca tccgtgtgga 2040 

aacacctgat gtcaatctag atcaagagga agagatccag atggaggtag atgagggtgc 2100 

tggtggcatc aatggtcatg ctgacagccc tgctcctgtg aacgggatca atggctacaa 2160 

tgaagacata aatcaagagt ctgctcccaa agcctcctta aggctgggct tctctgagta 2220 

ctgccgaatc tctaacctta ttgtgcttca cctcagaaag gtggaagaag aagaggacga 2280 

gtcagcatta aagaggagcg agcttgttaa ctggtacttg aaggaaatcg aatcagagat 234 0 

agactctgaa gaagaactta taaataaaaa aagaatcata gagaaagtta ttcatcgact 2400 

cacacactat gatcatgttc taattgagct cacccaggct ggattgaaag gctccacaga 24 60 

gggaagtgag agctatgaag aagatcccta cttggtagtt aaccctaact acttgctcga 2520 

agattgagat agtgaaagta actgaccaga gctgaggaac tgtggcacag cacctcgtgg 2580 

cctggagcct ggctggagct ctgctaggga cagaagtgtt tctggaagtg atgcttccag 2640 

gatttgtttt cagaaacaag aattgagttg atggtcctat gtgtcacatt catcacaggt 2700 

ttcataccaa cacaggcttc agcacttcct ttggtgtgtt tcctgtccca gtgaagttgg 2760 

aaccaaataa tgtgtagtct ctataaccaa tacctttgtt ttcatgtgta agaaaaggcc 2820 

cattactttt aaggtatgtg ctgtcctatt gagcaaataa ctttttttca attgccagct 2880 

actgctttta ttcatcaaaa taaaataact tgttctg 2917 

<210> 413 
<211> 5701 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_001282 
<400> 413 

ctgcccacca tctttgtccc tggcaaagtg ggttttgcgc agtggcttag acctagaaaa 60 
gaatcgtgac gggcaggaaa ccattacacc accacctggg ctgtgctctc cggctcccgc 120 
cgccaccccc gccctcgcct tcgcctccgc tccggtgcac attaaagatc caaagtcatg 180 
actgactcca agtatttcac aaccaataaa aaaggagaaa tatttgaact aaaagctgaa 240 
ctcaacaatg aaaagaaaga aaagagaaag gaggctgtga agaaagtgat tgctgotatg 300 
accgtgggga aggatgttag ttctctcttt ccagacgtag tgaactgtat gcagactgac 360 
aatctggaac taaagaagct tgtgtatctc tacttgatga actacgccaa gagtcagcca 420 
gacatggcca tcatggctgt aaacagcttt gtgaaggact gtgaagatcc taatcctttg 480 
attcgagcct tggcagtcag aaccatgggg tgcatccggg tagacaaaat tacagaatat 540 
ctctgtgagc cgctccgcaa gtgcttgaag gatgaggatc cctatgttcg gaaaacagca 600 
gcagtctgcg tggcaaaact ccatgatatc aatgcccaaa tggtggaaga tcagggattt 660 
ctggattctc tacgggatct catagcagat tcaaatccaa tggtggtggc taatgccgta 720 
gcggcattat ctgaaatcag tgagtctcac ccaaacagca acttacttga tctgaaccca 780 
cagaacatta ataagctgct gacagccctg aatgaatgca ctgaatgggg ccagattttc 840 
atcctggact gcctgtctaa ttacaaccct aaagatgatc gggaggctca gagcatctgt 900 
gagcgggtaa ctccccggct atcccatgcc aactcagcag tggtgctttc agcggtaaaa 960 
gtcctaatga agtttctaga attgttacct aaggattctg actactacaa tatgctgctg 1020 
aagaagttag cccctccact tgtcactttg ctgtctgggg agccagaagt gcagtatgtc 1080 
gccctgagga acatcaactt aattgtccag aaaaggcctg aaatcttgaa gcaggaaatc 1140 
aaagtcttct ttgtgaagta caatgatccc atctatgtta aactagagaa gttggacatc 1200 
atgattcgtt tggcatctca agccaacatt gctcaggttc tggcagaact gaaagaatat 12 60 
gctacagagg tggatgttga ctttgttcga aaagctgtgc gggccattgg acggtgtgcc 1320 
atcaaggtgg agcaatctgc agagcgctgt gtaagcacat tgcttgatct aatccagacc 1380 
aaagtgaatt atgtggtcca agaagcaatt gttgtcatca gggacatctt ccgcaaatac 1440 
cccaacaagt atgaaagtat catcgccact ctgtgtgaga acttagactc gctggatgag 1500 
ccagatgctc gagcagctat gatttggatt gtgggagaat atgctgaaag aattgacaat 1560 
gcagatgagt tactagaaag cttcctggag ggttttcacg atgaaagcac ccaggtgcag 1620 
ctcactctgc ttactgccat agtgaagctg tttctcaaga aaccatcaga aacacaggag 1680 
ctagtccagc aggtcttgag tttggcaaca caggattctg ataatcctga ccttcgagac 1740 
cggggctata tttattggcg ccttctctca actgaccctg ttacagctaa agaagtagtc 1800 
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ttgtctgaga agccactgat ctctgaggag acggacctta ttgagccaac tctgctggat 1860 

gagctaatct gccacattgg ttctttggcc tctgtgtatc ataagcctcc caatgctttt 1920 

gtggaaggaa gtcatggaat tcatcgtaaa cacttgccaa ttcatcatgg gagcactgat 1980 

gcaggtgaca gccctgttgg cactaccact gcaacgaacc tggaacagcc tcaggttatc 2040 

ccctctcaag gtgatcttct aggggatctt ttaaaccttg acctcggtcc cccagtcaat 2100 

gtgccacagg tgtcctccat gcagatggga gcagtggatc tcctaggagg aggactagat 2160 

agtctggtgg gacaatcctt catcccatca tcggtgcctg caacctttgc tccttcacct 2220 

acacctgctg tggtcagcag tggactgaat gacctgtttg aactctccac agggataggc 2280 

atggcacctg gtggatatgt ggctcctaag gctgtctggc tacctgcagt aaaggctaaa 2340 

ggcttggaga tttccggaac atttactcac cgccaagggc acatctatat ggaaatgaac 24 00 

ttcaccaata aagctctgca gcacatgaca gattttgcaa tccagtttaa caaaaatagc 24 60 

tttggtgtca tccccagcac tcctctggcc atccatacac cactgatgcc aaaccagagc 2520 

attgatgtct ccctgcctct caataccttg ggcccagtca tgaagatgga acctctgaat 2580 

aacctccagg tggctgtgaa aaacaatatc gatgtcttct acttcagctg cctcatccca 2640 

ctcaatgtgc tttttgtaga agatggcaaa atggagcgcc aggtcttcct tgcaacatgg 2700 

aaggatattc ccaatgaaaa tgaacttcag tttcagatta aggaatgtca tttaaatgct 27 60 

gacactgttt ccagcaagtt gcaaaacaac aatgtttata ctattgccaa gaggaatgtg 2820 

gaagggcagg acatgctgta ccaatccctg aagctcacta atggcatttg gattttggcc 2880 

gaactacgta tccagccagg aaaccccaat tacacgctgt cactgaagtg tagagctcct 2940 

gaagtctctc aatacatcta tcaggtctac gacagcattt tgaaaaacta acaagactgg 3000 

tccagtaccc ttcaaccatg ctgtgatcgg tgcaagtcaa gaactcttaa ctggaagaaa 3060 

ttgtattgct gcgtagaatc tgaacacaot gaggccacct agcaaggtag taactagtct 3120 

aacctgtgct aacattaggg cacaacctgt tggatagttt tagcttcctg tgaacatttg 3180 

taaccactgc ttcagtcacc tcccacctct tgccacctgc tgctgctatc tgtccttact 3240 

tgtgggcttc tccatgctgt gccaatggct ggctttttct acaccctctt ttgagtgtag 3300 

tttggtattt tgtaattgag agctcatttc aaaagcagaa aaagacaaca aatattaaag 3360 

caaggaaaag tgtaactgaa acactgcact ttactgtttt atacttttgt acatatgaga 3420 

aatcaaggga ttagtgcaac cagtagaagg cattgaaatg actgtcatta accacacagt 3480 

cctggaggca gagatgcagt tacctaccct agcttttgat gggttctctt acctgtagta 3540 

gccttatccc tggtcatttg gattttcagt ttgctttttt ctttttttcc cctccaaact 3600 

ccttttcctt ggccaagcct tcatgcttcc ccctttccat attataatct catttgattg 3660 

ctctgcagtt gggaacggtg atcttcttga atgatgtttc agtgtgcaaa aactatagag 3720 

cctgtcagca ccaaagctga cagaagttat. accttactcc tttcctttcc cctgaacaaa 3780 

cctgctaatc ccactaattc aggaatttga gtagagatgg ggaacaagaa cccagatgct 3840 

gtcccctcac cccctctcct gtatttctca ggtccagttc aaatctaaaa ttctactttt 3900 

agagttgaaa cagagtaata acttatctaa ccctcttttc ctacaaagga gaaagataaa 3960 

aggcacaaag gttaccgcca aggcccgtca gctgtgtagt ggcaaagccg agaccgagtc 4020 

tcctaagtcc ccgtcagtgt ggttttcacc acaggactgt ctcttgtcgt tttcccctaa 4080 

tgccttctcc tgccttttct gtgcctagtt tttggctctt cacatattcc atattgattt 4140 

tgacgctctg tatattggca tcaggtggca gctgaatatc ttttgaatta ctcgaaggta 4200 

aagccagatg ccagaatgaa ggtgtagcca gtgtttccca tatgcccctg gagccccact 4260 

tattgaggcc agcagaatag gtgcagagat gaagtgagct tagagatgtt gcaaatgctc 4320 

tttatccctt cagctctctg atctgctctt tcttcatgat acttagtctg cagggcatat 4380 

taagatcatc ccagaggttc aggcagttcc tgtcatctct gaaaagactg ggggatatga 4440 

aatcttcccc ctaccccact taatgcgttg gatatgattt ttcaaagaat gcttcatgcc 4500 

caaaatacca gcctgtttag cagtgttaca ctgtttgatc tgcgggcact tgttgcattg 4560 

cctggcaccc aatattcagg gtccatgact aagactggtc ttctcagatg ccctgcttaa 4 620 

atcaggggca cttcaggctc cacaggcgtc atgttggact gagacctaac tcactggact 4 680 

cagaggagga atcgtggaaa acaagagcaa aactacccca cacccctatt tcatgtctga 4740 

aataaccctg tttcatacca gttgcaaagc ttgtggggag cggtcccaca aagcactttc 4800 

ttaaaccttg agaatctcca agagaaaaat atttggggaa ggagggagga aatatgtccc 4860 

ttgcacacca cccctgaagc acatggcagt aggaaacagc ataggattgt atgtgggagg 4 920 

tggataggtc ggtgatgtgt ggagcggaaa agcaggttgg taaagttccc ttcttgggac 4 980 

ttattcctgg agtcagtgga tacaagtagt gcagaaggtt cacactgcaa atagtgttct 504 0 

catctcaaag caaactatca ttccagaagg aaaagtgtgt cagggcaagc agacaacaca 5100 

atttcctatc agaatatgtc cctcaacccc cgaaacaagg cttctctcag cctccccacc 5160 

agtgatggat aacagctcct attctcagct gacctgactg agccaaccca tgaactcttc 5220 

actccttggg gaagccacct cccatcacac ccctgagcag agttagggag gaattctact 5280 

tcccataaaa ggacctctcc tgagaggcaa aacctgttgc ctccaccacg gcttccctct 534 0 

tggctcattc caagcttggc caaattgggg aagtgggatg gaggttgccc tgcatccccc 5400 

ctcctctgcc tgagtgtgtc tttgtaatgt cagctggcat catacaaaga gcaggagaag 5460 
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caaacaccca gaactctttt gctggtcaga gattccctga gtgtctgtcc tcacccaagc 5520 

ctgctctgtg tctgtgttgt gaagcttgag actctggaaa gaaatgggga gggggggcag 5580 

gggaaatgtt gccctaagaa tgcttctcat tcctctgttc ttattgggtc ctgtttttcg 5640 

ggagggtggg ggttggggga agcttgacct tgtgtcttcg tcaataaact cacatttaca 5700 



<210> 414 
<211> 1165 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig56457_RC 
<400> 414 

gcaagtcttt tttttttttt taataaaacc aacataacag aatgtagaaa tctaaacgaa 60 
catttctccc tcaaagtagt tacatcctta gctccaacaa aactactatt gttggagact 120 
tatttagaac tcctgtttga ggaaaggcct taaaaggtgg tttatgagcc acattagaaa 180 
atacccttct aaactttggg ctgtttaaaa acagaaaatc catactcaga ggatgacacc 240 
gagaaaatta aaaaggtgct gcagctctca cggaggttcc ctaaagacct ctagcaagtt 300 
ctgagcaaca gcatcaccat tggagaaacg tgtagcctcc caggacagcc actttccatg 360 
ggacggcatt catttggata gaaaagttct gctgtctgtt taaaaagaaa aaagaaaaaa 4 20 
aaaaaatcag ccacactcct ttatagtcac acttgcattt gattaaaaaa aaaaacaaca 480 
acaacaacac tcacacgtgg ttcttgtatg aattctaatt aagagacatt atattcagaa 540 
ttatagcact catagtgata attttcagaa gactataaca gcttctttgc cacttaaggg 600 
gtacagaggg tgcagctgtt tacctgaatg aattcttaaa taagtgacta ctaccactct 660 
aattttccta gttgtagctc tgttacaggt tactgaccgc caagtccatg ttccagacat 720 
tctttgccaa cacagattaa ttacaagaga gccaccccca aagtcttagg gcgaattggg 780 
aaagggccgg ttaaatgagg aaacaggcca ttccttggta aaagtttgta gttttctttc 840 
ccaatacatc ttctttttat tgagaacttc catttttatc ctgtgttcct cctctgccat 900 
ctcacactct cgctctatct gctggatttt ggccacatgc acctcattca tttgtatcag 960 
ttgcagttgt aagatggaca tttgttgatg gtgttcttcc tcatgcatct tttttaaagc 1020 
accttcctga gaggttttgc tcctgtagtt tttattggct gttattctga cagctgaaca 1080 
cgagggttca ggttgagagg tcccctcaca tactggaaaa tgtatgaact ctttctcatc 1140 
atcgcacagt tctctatttg aaaca 1165 

<210> 415 

<211> 867 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_020386 
<400> 415 

cgtttcagcg tggcggcgct ggtgctggcg ttggccctgg aggacggccc cgagtgatgg 60 
ctggcgcctg cctcccgggt gtctcccggg tacagatgga gtcgtcccgc ggccgccggc 120 
ggcaaggtcg gcagctgcga ggccaagaga gaccccagga cacacacagc tgcctcccgg 180 
tgcgagaaga agaccccggc ttgagagtga gatggcgttt aatgattgct tcagtttgaa 240 
ctaccctggc aacccctgcc caggggactt gatcgaagtg ttccgtcctg gctatcagca 300 
ctgggccctg tacttgggtg atggttacgt tatcaacata gcacctgtag atggcattcc 360 
tgcgtccttt acaagcgcca agtctgtatt cagcagtaag gccctggtga aaatgcagct 420 
cttgaaggat gttgtgggaa atgacacata cagaataaac aataaatacg atgaaacgta 480 
cccccctctc cctgtggaag aaatcataaa gcggtcagag tttgtaattg gacaggaggt 540 
ggcctataac ttacttgtca acaactgtga acattttgtg acattgcttc gctatggaga 600 
aggagtttca gagcaggcca accgagcgat aagtaccgtt gagtttgtga cagctgctgt 660 
tggtgtcttc tcattcctgg gcttgtttcc aaaaggacaa agagcaaaat actattaaca 720 
atttaccaaa gagatattga tattgaagga atttgggagg aggaaaagaa acctggggtg 780 
aatacttatt ttcagtgcat cattactgtt ccagattcct atgatggatg gcagactctt 840 
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taataaattg cttactgata ttatctt 867 

<210> 416 
<211> 3432 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 2741 

<223> n = A,T,C or G 
<300> 

<308> NM_014889 
<400> 416 

ctcgtcatgc gcaatgtggc gctgcggcgg gcggcagggc ctgtgtgtgc tgaggcggct 60 
gagcggcgga catgcacacc acagagcgtg gcgatggaac agtaaccggg cttgtgagag 120 
ggctctgcag tataaactag gagacaagat ccatggattc accgtaaacc aggtgacatc 180 
tgttcccgag ctgttcctga ctgcagtgaa gctcacccat gatgacacag gagccaggta 240 
tttacacctg gccagagaag acacgaataa tctgttcagc gtgcagttcc gtaccactcc 300 
catggacagt actggtgttc ctcacattct tgagcatacc gtcctttgtg ggtctcaaaa 360 
atatccgtgc agaaaccctt tcttcaaaat gttgaaccgg tccctctcca cgttcatgaa 420 
cgccttcaca gctagtgatt atactctgta tccattttcc acacaaaatc ccaaggactt 480 
tcagaatctc ctctcggtgt atttggatgc caccttttcc ccatgtttac gcgagctgga 540 
tttctggcag gaaggatggc ggctggaaca tgagaatccg agcgaccccc agacgccctt 600 
ggtctttaaa ggagtcgtct ttaatgagat gaagggagcg tttacagaca atgagaggat 660 
attctcccag caccttcaga acagacttct tcccgaccac acgtactcag tggtctccgg 720 
gggtgaccca ctgtgcatcc cggagcttac atgggagcag cttaagcagt ttcatgccac 780 
tcactatcac ccaagcaatg ctaggttctt cacgtacggt aattttccat tagaacagca 840 
tctgaaacaa attcacgagg aagcactgag caaattccag aaaattgaac caagcaccgt 900 
ggtgccagct cagacaccct gggacaagcc tagggaattc cagataacat gtggcccgga 960 
ttcatttgct acagatccct ctaaacaaac aaccgtcagc gttagcttcc tcttaccgga 1020 
catcaccgac acatttgaag ccttcacatt aagtcttctg tcttcactct tgacttctgg 1080 
gcccaattct cccttttaca aagccttgat tgaatctggc cttggcacag aattttctcc 1140 
tgatgttgga tataatggct acacgaggga ggcctacttt agtgtcggcc tccaagggat 1200 
tgtggagaaa gacattgaga ccgtcagaag cctcatagac agaacgattg atgaagtagt 1260 
tgagacaagg attgaagatg atcgaattga ggctttactt cataaaattg aaatacagat 1320 
gaaacatcag tctaccagct ttgggctgat gctgacatca tacatagctt cttgctggaa 1380 
ccatgatggg gaccctgtgg agctcttgaa gttgggaaat cagttagcta aattcagaca 1440 
gtgcctgcag gaaaatccaa aatttttgca agaaaaagta aaacagtatt ttaagaataa 1500 
ccagcataag ctgactttat cgatgaggcc agatgacaag tatcacgaga agcaggcaca 1560 
ggtggaagcc acgaagctca agcagaaggt cgaggctctg tcccccggag acaggcagca 1620 
gatctacgag aaaggtctag aattacggag tcaacaaagc aaacctcaag atgcctcttg 1680 
tctgccagcg ttgaaagttt ccgatattga acccaccata cctgtcacag agttggacgt 1740 
ggtcctgaca gctggagata tccctgttca gtactgcgcc cagcccacca atggcatggt 1800 
gtatttccgg gccttctcca gcctgaacac actccccgag gagctgaggc cctatgtgcc 18 60 
cctcttctgc agcatcctca ccaagctggg ctgcggcctt cttgactacc gggagcaggc 1920 
tcagcagata gaattgaaga ccggagggat gagtgcttct ccccacgtgc tccccgacga 1980 
ctcacacatg gacacctacg agcaggtagg tgtgcttttc tcctctctct gcctggatcg 2040 
aaacctgcca gacatgatgc agctatggag tgaaatattt aacaacccgt gctttgaaga 2100 
agaggagcac ttcaaggtgc tggtgaagat gaccgcccag gagctcgcca atggaattcc 2160 
tgactctggg cacctgtacg catccatcag ggcaggccgg accctcacgc ccgcagggga 2220 
cctgcaggag accttcagcg ggatggatca ggtgcggctg atgaagagga ttgcagaaat 2280 
gacagatatc aaacccatcc tgaggaagct cccgcgtatc aagaaacact tgttaaatgg 2340 
tgataatatg aggtgttcag tgaatgcgac tcctcagcag atgcctcaga cagaaaaagc 2400 
ggtcgaagac ttccttagaa gcatcggtcg gagtaaaaag gaacggaggc ctgtgcgccc 2460 
acacacggtc gagaaacctg tgcccagcag ctctggtgga gatgcccacg ttccccatgg 2520 
ctcccaggtc attaggaagc tggtcatgga acccaccttc aagccctggc agatgaagac 2580 
tcacttcctg atgcccttcc cggtgaatta cgtgggtgaa tgcatccgaa ctgtccccta 2640 
cacggaccca gatcatgcca gtcttaaaat ccttgcacgt ttgatgactg ccaaattctt 2700 
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gcatacagaa attcgagaaa aaggcggtgc ttatggtgga ngcgcaaaac tcagccacaa 27 60 

tgggattttc accctttact cttacaggga cccaaataca atagagacgc tccagtcttt 2820 

tgggaaggct gtcgactggg ctaagtctgg aaaattcaca cagcaagaca tcgacgaagc 2880 

caaactttct gtcttctcaa ccgtagatgc tcctgtcgct ccttcagaca aaggaatgga 2940 

ccacttcttg tacggcctct cggatgagat gaagcaggcc cacagagagc agctctttgc 3000 

tgtcagccac gacaagctcc tggccgtgag cgataggtac ctcggcactg ggaagagcac 3060 

acacggcctg gccatcctcg gacccgagaa cccgaaaatt gccaaggacc catcctggat 3120 

catccgatga gcagccgtgg cgctcgactg cacaggagcc cgagacaata cacctccaag 3180 

ctgaatatga aaagtcagaa atgctactgc tttttccaag aatattatgt cattgagtgt 3240 

cgccaaagcc cttgactggc gagtcaaaaa ctcagatcta tcttaagagt gaccaggaag 3300 

aggttcattg aaataatcat gcatgaagcg ccaaagatgc accatgtaga attttcactt 3360 

tgtactggca ggctcgtttt acctgattct agaatattta agaatctaaa aataaagggc 3420 

aactctgact ta 3432 

<210> 417 
<211> 817 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_006681 
<400> 417 

agtcctgcgt ccgggccccg aggcgcagca gggcaccagg tggagcacca gctacgcgtg 60 
gcgcagcgca gcgtccctag caccgagcct cccgcagccg ccgagatgct gcgaacagag 120 
agctgccgcc ccaggtcgcc cgccggacag gtggccgcgg cgtccccgct cctgctgctg 180 
ctgctgctgc tcgcctggtg cgcgggcgcc tgccgaggtg ctccaatatt acctcaagga 24 0 
ttacagcctg aacaacagct acagttgtgg aatgagatag atgatacttg ttcgtctttt 300 
ctgtccattg attctcagcc tcaggcatcc aacgcactgg aggagctttg ctttatgatt 360 
atgggaatgc taccaaagcc tcaggaacaa gatgaaaaag ataatactaa aaggttctta 420 
tttcattatt cgaagacaca gaagttgggc aagtcaaatg ttgtgtcgtc agttgtgcat 480 
ccgttgctgc agctcgttcc tcacctgcat gagagaagaa tgaagagatt cagagtggac 540 
gaagaattcc aaagtccctt tgcaagtcaa agtcgaggat attttttatt caggccacgg 600 
aatggaagaa ggtcagcagg gttcatttaa aatggatgcc agctaatttt ccacagagca 660 
atgctatgga atacaaaatg tactgacatt ttgttttctt ctgaaaaaaa tccttgctaa 720 
atgtactctg ttgaaaatcc ctgtgttgtc aatgttctca gttgtaacaa tgttgtaaat 780 
gttcaatttg ttgaaaatta aaaaatctaa aaataaa 817 

<210> 418 

<211> 5210 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007203 
<400> 418 

ccagaggctg agcccgggcg agcccgccgt gcgcacagct ctgcccgccg cccctggagc 60 

ggatccccag ccgtccctgg gcgttctccc gggttctagc aaagcccccc ggggtgccca 120 

tcagtttcct tgggtgacta cagcgtgtgt tttttctttc ctctttcccc tgcctgtgtg 180 

cccttctcca ggatggcaga ggcggaattg cacaaggaaa ggctgcaagc catagcagaa 240 

aaaagaaaga ggcagactga aatagaaggc aagcgacaac agcttgacga gcagatactt 300 

ctgctgcagc attccaagtc caaagtgctt cgggagaaat ggctgctgca gggcataccc 360 

gctggaactg ccgaagagga ggaagccagg aggcggcagt ctgaagagga tgagttcaga 420 

gtcaagcaac ttgaagataa cattcagagg ctggagcaag aaatacaaac gctagaaagt 480 

gaagagtccc agatatctgc caaagagcaa atcatcctag agaaactgaa ggaaacagaa 540 

aaatccttca aggactttca gaagggtttc tccagtacgg atggagatgc agtaaattac 600 

atttcctccc agcttcccga cctgccaatc ctctgttcac gaacagcaga accatcacct 660 

gggcaggacg ggaccagcag agcggctgga gtcgggtggg agaatgtgct gctaaaggaa 720 

ggtgagtcag cctcgaacgc cacagaaaca tccggcccag acatgactat caagaagcct 780 
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ccccagcttt ctgaggatga tatctggcta 
ctcctggagc ctgctgccag ctctctttcc 
tctgttgcag aatgtaaaag tgttcctgga 
ccctccgctt tctattcacc cccgcataat 
gataatgatg ttgccagaga gatccgctat 
gattctgctg tggatggaac gtacaatgga 
ctggtgggcg gcctaagccc ccctgtccac 
acagctagcc ggcaggcacc tcctcacatc 
gcagaggcag aaagaacaaa tggccattcc 
agcctgcagg tgcctgtcag cccca'gctcc 
ggagagttca ctctcaccac actgaaaaag 
gaggacaaga agccctccaa gctctttgag 
attagaaaag tgaggccttc agaggagatg 
atccgcagcc aggccgtcaa gaagaatcct 
caggaaaaaa ccatcgagga gcagctggac 
aaggagcgca aagagagaag ggcacagcag 
cagcagcagc agcagccccc atcgcagctc 
gcaagcatga ttgacaaagc aaaggaggac 
gctcgcaaac aatttcagct gatggagaat 
acacccaggc tgttctccat caagcctttc 
aagccactga ctaatccgag accaccttct 
tcagccgcca agggacagaa atcccccggt 
cagggcaaca cagcctctca ggggaaggaa 
cccttatcta aactgtgggc tgaggatgga 
gtggtcaagg atgatgacca tgggattttg 
ttgacccaag aggagcttga ctctggtctg 
acagtcctgg agaccctatc caatgatttc 
tccaatgaga caaccaatgc cctccaggaa 
acaccacaaa ctgacaaccc ctcagagggc 
gatcatggtt tctattcccc ttcctocacg 
ttggagtatc aggctggcct cctggtgcag 
gtggataaag ctgtgtccaa aaccagcagg 
actgtagagg aagctgaagc tgcggctttc 
ccacctcagg tgtcttctcc tgttcaagag 
gctgaagaca gggcgctcag ggaaaggggg 
agtggcccga ttaacatgga ggagaccagg 
gaggcagctg agctgagaag cacagcctcc 
gttgggcctt tcaagctgag gtccaggaaa 
atccgagcag ctcaggaaag ggaagaggag 
acgcagagcc ccaggacaaa gaatgcccca 
gctccaggga aaatagagaa agtcaaacct 
ctgcagcctg acttagcccc tgaagaggct 
cagaccctca tggaagacta tgagacacac 
agtagttaca cttctaagtt actgtcttgc 
cgggttaatc gaagaaagag cgcactggct 
caggaggaag aagacaacga ataaacttcc 
gagaccaaga aaccaagaaa ttaacaactg 
gcaaacaaac tcagcaatcc ttatgtagac 
aaacgaaaag gaagtccccc catcagactc 
ttttttggtg actcaatccc agtgtctggt 
acaaaacatg ttacaagcaa atcaacatag 
ctgggcgatg aaggacaatg gtggtagggc 
gcagcctaca tggcacagga tatttccctg 
cagagggtgt tgcattaact ccattttctc 
ccatatttgc ccagaagagg cagttatgat 
ttgagaacaa ctcatgatca caattcataa 
gtcacaaaca caatctattc tacatgagga 
aaatggatga tgttagcttt aggatcgtgt 
atgagaacac attttcacag ttatgtttct 
actattttta tgatgggatg ggggagtata 
acttggttca agatggtggg ggcattttta 



aaaagcgagg gagacaacta tagtgccacc 840 
ccagatcaca aaaacatgga aattgaggtg 900 
atcacctcta ccccacatcc catggaccat 960 
ggcctcctta ctgatcacca cgaatccctg 1020 
ctagatgagg tgctagaggc caactgctgt 1080 
acatcctccc cagagcctgg tgcagtggtt 1140 
gaggcgaccc agccagaacc cactgaaaga 1200 
gagctcagta atagcagccc tgaccccatg 1260 
cccagccagc ctagagatgc gctgggggac 1320 
acaaccagct cacggtgttc ttcccgagat 1380 
gaggccaagt ttgagctgcg tgccttccat 1440 
gatgacgagc atgagaaaga acaatactgc 1500 
ctggagctgg aaaaggagag gagagagctc 1560 
ggcattgcag caaaatggtg gaatcccccg 1620 
gaggaacatc tggagtcgca caaaaagtac 1680 
gaacagttgc tgctgcagaa gcagttacag 1740 
tgcacagccc ctgcctcttc tcatgaacgc 1800 
attgtcacag agcagataga tttctctgct 1860 
tccaggcaag cggtggccaa gggccagagt 1920 
tacaggcctc tggggtcagt caactcagac 1980 
gtcgggggac ctccagaaga cagtggtgcc 2040 
gccctggaga ccccatcggc agcaggaagc 2100 
gggccctaca gcgagccttc taaacgtggg 2160 
gaatttacga gcgcccgggc tgtcctcact 2220 
gatcagttct caagatctgt caatgtctcc 2280 
gacgaattgt cggtgaggtc tcaggatacc 2340 
agcatggaca acatcagtga cagcggggca 2400 
aattcactgg ctgatttttc tctgccccag 2460 
cgaggagaag gcgtctccaa gtcatttagt 2520 
ctgggggact ctccgttggt tgatgacccc 2580 
aatgccattc aacaagccat agccgagcag 2640 
gatggagcag agcaacaggg acctgaagcg 2700 
ggctcagaaa agcctcagag catgtttgag 27 60 
aaaagggatg tattaccaaa gattctgcct 2820 
cccccccagc cactgccagc tgtgcagccc 2880 
cccgaaggaa gctatttcag caagtactcg 2940 
ctcctggcca ctcaagaatc tgacgtgatg 3000 
cagcggactt tgtccatgat tgaggaagag 3060 
ctgaagaggc agagacaagt cttgcagagt 3120 
tcactgccct ccagaacatg ctacaaaact 3180 
cctccatccc ccaccactga aggccccagc 3240 
gccggaaccc agcggcccaa gaatctgatg 3300 
aaatctaaaa ggcgcgagag aatggatgat 3360 
aaggtgactt ccgaggtcct cgaggccaca 3420 
ttgcgctggg aagcagggat ctatgccaac 3480 
ttcaacccag gaagcgtctt tggtgcttgg 3540 
aaagcatttt aatggactat ttattaaagt 3600 
cagaagctgc aatatacaat gatgaaaatg 3660 
tggacaccca ggaatcaaaa gagaaaggac 3720 
ttggggccaa tctacaatgg atccaaaagg 3780 
ttgctgcagg cggaactgag accagcctct 3840 
cacttagaga gcagtagggc ccaggctggt 3900 
caccagtttc ccacctgtca cttgcagcat 3960 
cagcagccat aaaatgaatg gcgtttctct 4020 
tccacttctc ctacatttaa ttatttttaa 4080 
aatccatatt ttaagacaat agttcttcat 4140 
ggctgagaaa ggggaagaga aaaacaaacc 4200 
ctgcttataa tattcactga tctgcagttt 4260 
tcataggaaa actatattta gtgggcaaca 4 320 
tacacgtata gaatctgtac gcgttgaaca 4380 
gagcggcaat aattgaaaaa aaaggcgaac 4440 



192 



WO 2006/084272 



PCT/LS2006/004280 



tctgccttgg agaggtagat gataagaaat aaaaaggtgt ttataactat tttgtattat 4500 
aaagtgggcc ttagagatag gaagaagaat gatggattcc ttttggatca atcagaaagg 4 560 
aaacacgaaa gaaaagtcag gaaggtagag agagaaaaag ggagggaagg agaaagaatg 4 620 
ggaataaaat aaggaggtaa gagatactat ttttgctgag caaccagtgt gtttcaggat 4 680 
gatacaaaga aaaatataga atagaaataa gtgcaggctt ggaatcagct acaaatccta 4740 
aagatggggt gtgtgtggat gtgtgtgtgt gtgtgtgtac accattgtgt gtttgtaaaa 4 800 
tgtgtatgtt catgagtaag ggtgtgtgtg tgtgtgtatt aaaattccag agtgaccgtg 4860 
gcacttgggt gtacaggtaa ttcctccaga gctgtttgct ggcttcagga gtggagtgag 4 920 
aatttctttt ttatgaaaag ggatataaag gcaccgagct gatgcagtat ttgtaatatt 4 980 
aagttgacct aacaaggtat ttgcatgagt cacaattaca aagttttgag cggttttgta 5040 
atttgacatt taggaaagtc tcctatttat tctcatactt tacattcatg cttagtatac 5100 
tatagaggat gccagcttta atctttctgt catttaaagc aatatgataa gggtattcaa 5160 
taattgggtg ccctaaattt ctggatgaga aaattttcaa ttctggccag 5210 



<210> 419 
<211> 449 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig63102_RC 
<400> 419 

ctatcccatt ttttcattta tttttatgag gagcctcaaa aaatagaagt ggccttggta 60 
gggaaggaat gtcgtgattc tgagtacagt atggcaaatt cttttgcctc agtttcccta 120 
actccatcca gtgcatatga aaatgtctat gtgcgtacaa ggtaggtctg gactaagctc 180 
tggccctccg gttgttttcc cattgcggga gcctaagcca gggtgcccag gactcctgtg 240 
tggctggtgg gcctgatgcc cacagctgat gattcaggaa ggaggggcct ggagtctgct 300 
gcagactctc ccgcccccta cagcccaaac cgggcactgc agcaatgagg tggtcagtcc 360 
agggagaggt ccgacaagtt ctcggcacaa tgctgcagga agtcaaagtc aggcacggag 420 
aagggcacgc gggaccactt tttggcctg 449 

<210> 420 
<211> 3044 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003981 
<400> 420 

gtttgcgggt ggttgttgct ctcggggccg tgtggagtag gtctggacct ggactcacgg 60 

ctgcttggag cgtccgccat gaggagaagt gaggtgctgg cggaggagtc catagtatgt 120 

ctgcagaaag ccctaaatca ccttcgggaa atatgggagc taattgggat tccagaggac 180 

cagcggttac aaagaactga ggtggtaaag aagcatatca aggaactcct ggatatgatg 240 

attgctgaag aggaaagcct gaaggaaaga ctcatcaaaa gcatatccgt ctgtcagaaa 300 

gagctgaaca ctctgtgcag cgagttacat gttgagccat ttcaggaaga aggagagacg 360 

accatcttgc aactagaaaa agatttgcgc acccaagtgg aattgatgcg aaaacagaaa 420 

aaggagagaa aacaggaact gaagctactt caagagcaag atcaagaact gtgcgaaatt 480 

ctttgtatgc cccactatga tattgacagt gcctcagtgc ccagcttaga agagctgaac 54 0 

cagttcaggc aacatgtgac aactttgagg gaaacaaagg cttctaggcg tgaggagttt 600 

gtcagtataa agagacagat catactgtgt atggaagaat tagaccacac cccagacaca 660 

agctttgaaa gagatgtggt gtgtgaagac gaagatgcct tttgtttgtc tttggagaat 720 

attgcaacac tacaaaagtt gctacggcag ctggaaatgc agaaatcaca aaatgaagca 780 

gtgtgtgagg ggctgcgtac tcaaatccga gagctctggg acaggttgca aatacctgaa 84 0 

gaagaaagag aagctgtggc caccattatg tctgggtcaa aggccaaggt ccggaaagcg 900 

ctgcaattag aagtggatcg gttggaagaa ctgaaaatgc aaaacatgaa gaaagtgatt 960 

gaggcaattc gagtggagct ggttcagtac tgggaccagt gcttttatag ccaggagcag 1020 

agacaagctt ttgccccttt ctgtgctgag gactacacag aaagtctgct ccagctccac 1080 
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gatgctgaga ttgtgcggtt aaaaaactac tatgaagttc acaaggaact ctttgaaggt 1140 
gtccagaagt gggaagaaac ctggaggctt ttcttagagt ttgagagaaa agcttcagat 1200 
ccaaatcgat ttacaaaccg aggaggaaat cttctaaaag aagaaaaaca acgagccaag 1260 
ctccagaaaa tgctgcccaa gctggaagaa gagttgaagg cacgaattga attgtgggaa 1320 
caggaacatt caaaggcatt tatggtgaat gggcagaaat tcatggagta tgtggcagaa 1380 
caatgggaga tgcatcgatt ggagaaagag agagccaagc aggaaagaca actgaagaac 14 40 
aaaaaacaga cagagacaga gatgctgtat ggcagcgctc ctcgaacacc tagcaagcgg 1500 
cgaggactgg ctcccaatac accgggcaaa gcacgtaagc tgaacactac caccatgtcc 1560 
aatgctacgg ccaatagtag cattcggcct atctttggag ggacagtcta ccactccccc 1620 
gtgtctcgac ttcctccttc tggcagcaag ccagtcgctg cttccacctg ttcagggaag 1680 
aaaacacccc gtactggcag gcatggagcc aacaaggaga acctggagct caacggcagc 1740 
atcctgagtg gtgggtaccc tggctcggcc cccctccagc gcaacttcag cattaattct 1800 
gttgccagca cctattctga gtttgcgaag gatccgtccc tctctgacag ttccactgtt 1860 
gggcttcagc gagaactttc aaaggcttcc aaatctgatg ctacttctgg aatcctcaat 1920 
tcaaccaaca tccagtcctg agaagccctg atcagtcaac cagctgtggc ttcctgtgcc 1980 
tagactggac ctaattatat gggggtgact ttagtttttc ttcagcttag gagtgcttga 2040 
aaccttggcc aggttccatg accatgggcc taacttaaag atgtgaatga gtgttacagt 2100 
tgaaagccca tcataggttt agtggtccta ggagacttgg ttttgactta tatacatgaa 2160 
aagtttatgg caagaagtgc aaattttagc atatggggcc tgacttctct accacataat 2220 
tctacttgct gaagcatgat caaagcttgt tttatttcac cactgtagga aaatgattga 2280 
ctatgcccat ccctgggggt aattttggca tgtatacctg taactagtaa ttaacatctt 2340 
ttttgtttag gcatgttcaa ttaatgctgt agctatcata gctttgctct tacctgaagc 2400 
cttgtcccca ccacacagga cagccttcct cctgaagaga atgtctttgt gtgtccgaag 2460 
ttgagatggc ctgccctact ggcaaagagg tgacaggaag gctgggagca gctttgttaa 2520 
attgtgttca gttctgttac acagtgcatt gccctttgtt gggggtatgc atgtatgaac 2580 
acacatgctt gtcggaacgc tttctcggcg tttgtccctt ggctctcatc tcccccattc 2640 
ctgtgcctac tttgcctgag ttcttctacc cccgcagttg ccagccagat tgggagtctg 2700 
tttgttccaa tgggttgagc tgtctttgtc gtggagatct ggaactttgc acatgtcact 2760 
actggggagg tgttcctgct ctagcttcca cgatgaggcg ccctctttac ctatcctctc 2820 
aatcaotact cttcttgaag cactattatt tattcttccg ctgtctgcct goagcagtac 2880 
tactgtcaac atagtgtaaa tggttctcaa aagcttacca gtgtggactt ggtgttagcc 2940 
acgctgttta cctcatacag tacgtgtcct gtttttaaaa tatacaatta ttcttaaaaa 3000 
taaattaaaa tcgtatactt acatttcaaa aaaaaaaaaa aaaa 304 4 

<210> 421 
<211> 407 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig20217 
<400> 421 

agtgttacag acagatattg tgaaattttt ccttgggctt ctttcaattt cgttcaattc 60 
tcgtccagta ttctcataac atttggatca ggtactttat caccttgtag tgtcactgtt 120 
cctttctgta tcctctcgaa gcctcctgtt actcgttttc taattctctt ccagcccact 180 
gcacgatcta gtgtcgaatg gatgaaacag ctgagcagaa cagttcttta gctgtcggcc 240 
taaatgacca acttcatgac ctctcaaggt ctcgctcccc ggctgcacgc agctcagctt 300 
ccaacattcg ggaaaggcgt gccggatcgc gaggggccgc tgggaccgag tccggacgtc 360 
gcgacgcgca gggggctgct gggccggcgg tctcgggaaa agtgagg 407 

<210> 422 

<211> 1170 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig2399_RC 
<400> 422 

aatgtttcaa caacatttat acatcttttt aattagatta gctttacaag caacttgaga 60 
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gctctttcat aatgaaacac tgcttttcaa attacatgac tttatagtct atctgtatat 120 
gaatcctgaa aatctttgcc tactcttgat aaaggtctgt gttttacagc tggttaatgt 180 
gttgaataaa aacagcatta tgaagcttta tctttaacca ggattaaaca catataggcc 240 
acaaacagtt ttaaattttg tttgaaatag agtcctgctt ggtgacaagt taatagtccc 300 
actaacatga aatgaatgga tattaaataa atggccatcc tgatttcttg atcttttcac 360 
aatgattcct aagagagtat ataaactttt ccaaaaaata tacaatttaa gacctgtctg 420 
gcaagcacaa cccacagata gtagaatcta atacttttta caactgtaga ggtacacatt 4 80 
tgaatgacaa caggggctcc caaattaaaa ggctgtggct gtacattttg acaatcaaac 540 
aaacctccca tcaacataaa acaatttcta ctaaccctaa aaaagataat gcttttgagt 600 
tgtggcctaa cgccgcgcac tgtactatag accagatggc aaagacaact tgcgctcctg 660 
ggttgggctg gcagaggcac ttactgtaga gatgtgatgt tatcatctgg aaggaagggg 7 20 
aggaagtaga acaaaactga atatgggcga ggaaaaaaca ataaactatc actttctcct 7 80 
gcaaattctc ctaagtgaga tcactatact gaacattcaa tcccatccta agtccactgt 840 
tggcttcact acacaatttt ttttccatgt tttacatgaa tagcaatggc aatcttttaa 900 
taaaaattac tgagcttccc atagaaaagg tctcaaattg aatacaaact atacagatgc 960 
taaaaatgtc ttaagttgaa atatacagaa aatttctgta tactcacatt ttggcaaaga 1020 
agatcaagga cagagataga aaacctaagt gggcaaggtc ttacacccat aattcacgca 1080 
aaatatggat accacccttt aggttttgct aaaaagccta ggaaagtgaa gtctaagaca 1140 
tccgtcccaa ttcattcttt cctcgtgccg 1170 

<210> 423 

<211> 2019 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007036 
<400> 423 

tgggtcacgg ctgcttccca ccagcaaaga ccacgactgg agagccgagc cggaggcagc 60 
tgggaaacat gaagagcgtc ttgctgctga ccacgctcct cgtgcctgca cacctggtgg 120 
ccgcctggag caataattat gcggtggact gccctcaaca ctgtgacagc agtgagtgca 180 
aaagcagccc gcgctgcaag aggacagtgc tcgacgactg tggctgctgc cgagtgtgcg 240 
ctgcagggcg gggagaaact tgctaccgca cagtctcagg catggatggc atgaagtgtg 300 
gcccggggct gaggtgtcag ccttctaatg gggaggatcc ttttggtgaa gagtttggta 360 
tctgcaaaga ctgtccctac ggcaccttcg ggatggattg cagagagacc tgcaactgcc 420 
agtcaggcat ctgtgacagg gggacgggaa aatgcctgaa attccccttc ttccaatatt 480 
cagtaaccaa gtcttccaac agatttgttt ctctcacgga gcatgacatg gcatctggag 540 
atggcaatat tgtgagagaa gaagttgtga aagagaatgc tgccgggtct cccgtaatga 600 
ggaaatggtt aaatccacgc tgatcccggc tgtgatttct gagagaaggc tctattttcg 660 
tgattgttca acacacagcc aacattttag gaactttcta gatatagcat aagtacatgt 720 
aatttttgaa gatccaaatt gtgatgcatg gtggatccag aaaacaaaaa gtaggatact 780 
tacaatccat aacatccata tgactgaaca cttgtatgtg tttgttaaat attcgaatgc 840 
atgtagattt gttaaatgtg tgtgtatagt aacactgaag aactaaaaat gcaatttagg 900 
taatcttaca tggagacagg tcaaccaaag agggagctag gcaaagctga agaccgcagt 960 
gagtcaaatt agttctttga ctttgatgta cattaatgtt gggatatgga atgaagactt 1020 
aagagcagga gaagatgggg agggggtggg agtgggaaat aaaatattta gcccttcctt 1080 
ggtaggtagc ttctctagaa tttaattgtg cttttttttt ttttttggct ttgggaaaag 1140 
tcaaaataaa acaaccagaa aacccctgaa ggaagtaaga tgtttgaagc ttatggaaat 1200 
ttgagtaaca aacagctttg aactgagagc aatttcaaaa ggctgctgat gtagttcccg 1260 
ggttacctgt atctgaagga cggttctggg gcataggaaa cacatacact tccataaata 1320 
gctttaacgt atgccacctc agagataaat ctaagaagta ttttacccac tggtggtttg 1380 
tgtgtgtatg aaggtaaata tttatatatt tttataaata aatgtgttag tgcaagtcat 1440 
cttccctacc catatttatc atcctcttga ggaaagaaat ctagtattat ttgttgaaaa 1500 
tggttagaat aaaaacctat gactctataa ggttttcaaa catctgaggc atgataaatt 1560 
tattatccat aattatagga gtcactctgg atttcaaaaa atgtcaaaaa atgagcaaca 1620 
gagggacctt atttaaacat aagtgctgtg acttcggtga attttcaatt taaggtatga 1680 
aaataagttt ttaggaggtt tgtaaaagaa gaatcaattt tcagcagaaa acatgtcaac 174 0 
tttaaaatat aggtggaatt aggagtatat ttgaaagaat cttagcacaa acaggactgt 1800 
tgtactagat gttcttagga aatatctcag aagtatttta tttgaagtga agaacttatt 1860 
taagaattat ttcagtattt acctgtattt tattcttgaa gttggccaac agagttgtga 1920 
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atgtgtgtgg aaggcctttg aatgtaaagc tgcataagct gttaggtttt gttttaaaag 1980 
gacatgttta ttattgttca ataaaaaaga acaagatac 2019 

<210> 424 

<211> 1720 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_018354 
<400> 424 

gactgcagat gaaattagta actggtgggg tcgtggggtg tgaatggtgg gcgggagcag 60 
ctatgtcagt tggtgtgttt ctgcttatgt tagggtaatt gggcacggcc tttgtgtaac 120 
tggtgaatat ctctgaacct gggcatgaaa cagagagatg tcctaactct gggtgagagg 180 
aatcctcatt tttctctgcc ctctcactgt ggcatcctaa gaaaaaagtt ttgggttcct 240 
gcagcatgaa ggagagctct gctcccagaa tttgggagct ccagatttct tccagggtgt 300 
ggaggcatca atatatcagt ctgggaaagg ggttcctggg ccactccagg agctgagttg 360 
ggtggaaggt gctgagagtg tgggtggggg ccacttctga gcacccatgt ggcacccact 420 
gctggtccct gtttgtggct gggcactcag gaaaatgttt ttggtgctaa gagtaaaaag 4 80 
ccaaccaaca aacacatctc ttttttctgt ctattcactg gaaagtaaaa gcagtctggg 540 
cgcaggctgg ggacccagat ggaattcaaa cttatgcctg ctctcaaggt gctcacggtt 600 
gctgataaac agctggataa aatgaagagt ctatgagtga gggatgcaga gccagggaag 660 
gctggtggag tgatgccacc agcacagggg tatgagtttg cagctgccaa ggggccaagg 720 
gatgagctgg ggccctcctt cccaatggca tctccccctg gtctggaact gaagacactg 780 
agcaatggtc cccaagcccc aaggagatca gctcccctgg gcccagtggc cccaaccagg 840 
gagggtgtgg agaatgcctg cttctcctca gaggagcatg agacccattt ccagaaccct 900 
gggaacacga gactgggcag ctcacccagt ccccctgggg gtgtctcctc actgccccga 960 
tcccagcggg atgatctgtc ccttcattca gaggaggggc cagccctgga gcccgtgagc 1020 
cgcccggtgg attatggctt tgtttccgcc ctcgttttcc tggtgagtgg gattcttctg 1080 
gtggtgacag catacgccat cccccgtgag gctcgagtca atccggacac agtgacagcg 1140 
cgggagatgg aacgactgga gatgtactac gcccgcctag gctcccacct ggacaggtgc 1200 
atcatcgcag gcctcgggct gctcacggtg ggcggcatgc tcttgtcggt gctgctcatg 1260 
gtctccctgt gcaagggcga gctgtaccgc cggaggacct tcgtccccgg caagggctcc 1320 
aggaagacct acggctccat taacctgcgc atgagacagc tcaatgggga tgggggccag 1380 
gccctggtgg agaatgaagt tgtccaggtc tcagagacta gccacaccct ccagaggtct 1440 
taagaactag cccaccttat ctggctgctt tagctccagt gctacaaggt ccaccccctg 1500 
ctcccgccca cctgacccct gccaaggccc tggggtttta aactgagctc acatagggcc 1560 
ttgtggaaga agtactgggt gctggaggga gagctcgggg cccagcccat gccccacacg 1620 
ggcaagcagc ccactgatct gttttgtagc tgaggttttg catacggttt tgtttggagg 1680 
atggcttctg ctgctaaaaa tacaaaagtt tggaaaccgc 1720 

<210> 425 
<211> 987 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NMJ300125 



<400> 425 

cggagagggg gagaacagac aacgggcggc 

cagcatggag ccttcggctg actggctggc 

ggtgcgggcg ctgctggagg cgggggcgct 

gccgatccag gtcatgatga tgggcagcgc 

cgcggagccc aactgcgccg accccgccac 

ggagggcttc ctggacacgc tggtggtgct 

cgatgcctgg ggccgtctgc ccgtggacct 

acggtacctg cgcgcggctg cggggggcac 

cgcggaaggt ccctcagaca tccccgattg 

ggaaacttag atcatcagtc accgaaggtc 



ggggagcagc atggagccgg cggcggggag 60 
cacggccgcg gcccggggtc gggtagagga 120 
gcccaacgca ccgaatagtt acggtcggag 180 
ccgagtggcg gagctgctgc tgctccacgg 240 
tctcacccga cccgtgcacg acgctgcccg 300 
gcaccgggcc ggggcgcggc tggacgtgcg 360 
ggctgaggag ctgggccatc gcgatgtcgc 420 
cagaggcagt aaccatgccc gcatagatgc 480 
aaagaaccag agaggctctg agaaacctcg 540 
ctacagggcc acaactgcoc ccgccacaac 600 
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ccaccccgct ttcgtagttt tcatttagaa 
aacgtagata taagccttcc cccactaccg 
ttcttataaa aatgtaaaaa agaaaaacac 
tctggagtga gcactcacgc cctaagcgca 
cagcctccgg aagctgtcga cttcatgaca 
ggttactggc ttctcttgag tcacactgct 
aataaaataa ttttcattca ttcactc 



aatagagctt ttaaaaatgt cctgcctttt 660 
taaatgtcca tttatatcat tttttatata 720 
cgcttctgcc ttttcactgt gttggagttt 780 
cattcatgtg ggcatttctt gcgagcctcg 84 0 
agcattttgt gaactaggga agctcagggg 900 
agcaaatggc agaaccaaag ctcaaataaa 960 
987 
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